Wikipedia talk:Verifiability

The project page associated with this talk page is an official policy on Wikipedia. Policies have wide acceptance among editors and are considered a standard for all users to follow. Please review policy editing recommendations before making any substantive change to this page. Always remember to keep cool when editing, and don't panic.

See WP:PROPOSAL for Wikipedia's procedural policy on the creation of new guidelines and policies. See how to contribute to Wikipedia guidance for recommendations regarding the creation and updating of policy and guideline pages.

Archives

Index 1, 2, 3, 4, 5, 6, 7, 8, 9, 10
11, 12, 13, 14, 15, 16, 17, 18, 19, 20
21, 22, 23, 24, 25, 26, 27, 28, 29, 30
31, 32, 33, 34, 35, 36, 37, 38, 39, 40
41, 42, 43, 44, 45, 46, 47, 48, 49, 50
51, 52, 53, 54, 55, 56, 57, 58, 59, 60
61, 62, 63, 64, 65, 66, 67, 68, 69, 70
71, 72, 73, 74, 75, 76, 77, 78, 79, 80
81

This page has archives. Sections older than 7 days may be automatically archived by when more than 3 sections are present.

"Threshold" again

Threshold is a word of many meanings, but the nearest relevant one, following wiktionary, is

(engineering) The quantitative point at which an action is triggered, especially a lower limit

To say that verifiability is the threshold for inclusion therefore implies that it triggers inclusion, and that everything verifiable is included. This would be a ridiculous proposition, never followed in practice, so it's a bit silly to state it thus. SamuelTheGhost (talk) 13:07, 19 December 2010 (UTC)[reply]

Well, from an engineering standpoint, your deduction is not 100% correct. A trigger can be what normally causes the event to happen, but still subject to other criteria. Trigger/gun is a great analogy...pulling the trigger normally fires it, but only if the safety is flipped off. But I agree that the term is ambiguous in an area where it is important that it be un-ambiguous. Also, the inclusion of an example ("truth") in that sentence of something that does not trump the requirement is also confusing and problematic. North8000 13:54, 19 December 2010 (UTC)

[inserted later] The definition for "threshold", that SamuelTheGhost gave in the first post, is definition 4 from wiktionary.org. From the viewpoint of definition 4, crossing a "threshold" triggers an action. Conversely, if an action was not triggered, then the threshold was not crossed. With this reading, if pulling a trigger does not fire the gun, then pulling the trigger is not a "threshold". So I think that SamuelTheGhost was precise, but more to the point, I think that we should agree that definition 4 from Wiktionary is not the intent of the first sentence of WP:V. RB 66.217.118.46 (talk) 19:58, 22 December 2010 (UTC)[reply]

[insert ends here]

Can you clarify what is ambiguous in the current version? --Nuujinn (talk) 14:02, 19 December 2010 (UTC)[reply]

Verifiability is the first of several thresholds for inclusion. But it is not the only threshold for inclusion. The fact that something is verifiable means that it passes the first test and may potentially be included (it also has to pass the other thresholds). However, because Verifiability is the first threshold, it does mean that anything that is not verifiable may be excluded. Blueboar (talk) 14:59, 19 December 2010 (UTC)[reply]

Its never implied (to me at least) that its the only threshold. AaronY (talk) 15:41, 19 December 2010 (UTC)[reply]

I'll use terms from normal logic. Verifiability, the way policy is worded, is necessary but not sufficient. Material also has to be notable, and there are also some special rules for special cases.--Andrew Lancaster (talk) 16:55, 19 December 2010 (UTC)[reply]

I think the best other requirement to consider as proof that not all verifiable material should be included is that a consensus must be reached among the article editors that material is interesting enough to include in an encyclopedia. Notability applies to an article as a whole, but the interesting criteria applies to each claim within an article. Jc3s5h (talk) 17:09, 19 December 2010 (UTC)[reply]

Bearing in mind that this is the change I wanted to introduce, it seems to me that everybody agrees that it says the right thing, some people agree that it's an improvement, and some people think that it's unnecessary because the existing text already says that, but nobody thinks that what I've proposed is actually wrong. SamuelTheGhost (talk) 17:20, 19 December 2010 (UTC)[reply]

Thanks for the patronization Alex. I guess what I'm saying is that its obvious to me if Bill Clinton attends a softball game in Peoria and it makes the local paper it doesn't merit inclusion in his article, even though technically this guideline says that it does. I don't care if the wording is changed, but I've never for a second thought it meant anything with a source gets in. Its common sense that it wouldn't. AaronY (talk) 17:47, 19 December 2010 (UTC)[reply]

This guideline sets the minimum for inclusion while WP:NPOV and WP:NOTABILTY are for weight and relevance. Just because we don't summarize those policies here (which we could) doesn't suggest anyone take the policy in isolation (in fact, that's explicitly cautioned against). Ocaasi (talk) 19:10, 19 December 2010 (UTC)[reply]

I don't see a need to change the language... The intent of the statement is to outline what may and may not be added. If no reliable sources verify Clinton's attendance at the softball game, we may not mention it (even if his attendance is "true"). If reliable sources verify his attendance then we may mention it... but that permission does not mean we must mention it. I think this is clear in the current language. Blueboar (talk) 18:37, 19 December 2010 (UTC)[reply]

The previous time these issues came up we ran into a language problem with both the word "notability" and the relation of WP:N to content policy. As noted by Wtmitchell [here], WP:N is not part of content policy. WP:NNC states, "The question of content coverage within a given page is governed by the principle of due weight and other content policies." At the same time, the word "notability" is frequently used on this page as it relates to content. It seems that the word used in the WP:WEIGHT policy is "prominence". Since I didn't like the word "prominence" in this context, I started referring to "notability/prominence". FYI, RB 66.217.118.112 (talk) 23:27, 19 December 2010 (UTC) [inserted later] See also: Wikipedia:Notability vs. prominence. RB 66.217.117.192 (talk) 15:16, 23 December 2010 (UTC) [insert ends here][reply]

Here is another definition of "threshold", the point being that a threshold isolates two states:

www.merriam-webster.com threshold 3b : a level, point, or value above which something is true or will take place and below which it is not or will not.

A threshold for inclusion is therefore also a threshold for the absence of inclusion. This is technically an ambiguity, where "ambiguity" is defined in www.merriam-webster.com as ambiguity 1b : "A word or expression that can be understood in two or more possible ways".

The problem for policy arises in considering verifiable material for the "absence of inclusion". WP:V says that editors cannot consider material that "could be not true" for the "absence of inclusion". RB 66.217.118.112 (talk) 02:15, 20 December 2010 (UTC)[reply]

There's also WP:Editorial discretion. Editors are expected, even required, to use their best judgment. WhatamIdoing (talk) 05:45, 20 December 2010 (UTC)[reply]

The "threshold for inclusion" implies a necessary, not a sufficient, condition. If the threshold for your inclusion at a party is that you bring the hostess some wine, it doesn't mean you'll be let in if you arrive with the wine, but also drunk and covered in mud. But there are times when the existence of a reliable source would be a sufficient condition too: if the article is underdeveloped, for example, or if the point is one required for NPOV.

We should be careful not to add anything to the policy that editors could use to reject reliable sources, because everything depends on context. What the policy currently implies is that if you arrive at an article with a good source, there has to be a strong editorial reason to keep your material out. SlimVirgin ^{talk|contribs} 06:26, 20 December 2010 (UTC)[reply]

If a reliable source has printed a mistake, WP:V policy should not require editors to ignore the error. Under WP:V, editors cannot consider a retraction by the newspaper for the story about the Clinton softball game, because WP:V is "whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think it is true." WP:V is currently a force to include mistakes in the encyclopedia, and this force has no documented purpose.

From WP:V:

See also, argument from authority.
Notes, 4. "Wales, Jimmy (16 May 2006). "Zero information is preferred to misleading or false information...I can NOT emphasize this enough."

RB 66.217.117.29 (talk) 22:29, 20 December 2010 (UTC)[reply]

*WhatamIdoing, Wikipedia:Editorial discretion has the status of an "essay", it says an essay may be a "minority viewpoint", and "Consider these views with discretion." Discretion in this case can include citing WP:V the way it is currently written, which is that it is incorrect to challenge verifiable fallacies; since Wikipedia is about verifiability, not the truth of the verifiable statement. RB 66.217.118.27 (talk) 19:29, 21 December 2010 (UTC)[reply]

With all of it's wording shortcomings, I think that it's clear that meeting wp:ver is a requirement for inclusion, not something that mandates inclusion.

It easy and common for a wp:"RS" to be unreliable, (wp:rs has no criteria for knowledge of the subject or objectivity) and to have information that is patently wrong, or just one of many conflicting opinions. More commonly, people misuse what's in a RS. For example, The New Your Times covers John Smith's claim that the earth is flat. Then they use the NYT as a source for the statement that the earth is flat. Either way, nothing in wp:ver mandates inclusion of anything. North8000 21:00, 21 December 2010 (UTC)

If the consensus of editors is to omit something... just omit it. The "Threshold" clause allows for this... and if that is not enough, you can always invoke WP:IAR. If, on the other hand, the editors are debating whether to include something, then we would need to know what the something is before we can address the issue further. The simple fact is, we do not require that everything verifiable be added to an article... but we do require that anything added be verifiable. Blueboar (talk) 22:54, 21 December 2010 (UTC)[reply]

I agree that WP:V is "not something that mandates inclusion", that "we do not require that everything verifiable be added", and "we do require that anything added be verifiable.

We've seen that the word "threshold" is itself ambiguous, as two definitions have been given and are in use. We've seen that there is a connotation of "threshold" that is the "threshold for the absence of inclusion". No one has offered a purpose for having a "threshold for the absence of inclusion". I agree with what Blueboar said, "The intent of the statement is to outline what may and may not be added." I submit that this proposal restores that intent, and that in the absence of further analysis, consensus exists to restore that intent. RB 66.217.118.91 (talk) 01:43, 22 December 2010 (UTC)[reply]

I don't think it is needed. The intent is already clear in the old language. Blueboar (talk) 02:15, 22 December 2010 (UTC)[reply]

BTW, before I said I thought there was a consensus, I concluded you were using SamuelTheGhost's definition of "threshold", and that my response would clarify that the word "threshold" is confounded. The question becomes, even if you don't see the need, and think the intent is clear in the old language, is that an objection to a change which to you is not a change? RB 66.217.118.91 (talk) 02:42, 22 December 2010 (UTC)[reply]

Blueboar, I'll take your silence as a "rhetorical answer". I'll also withdraw my thought that you were using SamuelTheGhost's definition, i.e., Wiktionary definition 4, of "threshold". At this point, I don't know what definition of "threshold" you meant when you said in your recent post, "The "Threshold" clause allows for this..." RB 66.217.118.46 (talk) 20:07, 22 December 2010 (UTC)[reply]

In summary, I think that "threshold" is ambiguous, and, per it's common use, would lead towards the wrong impression. I lost track of what the proposal is, but "requirement" or "condition" would be much better (and less ambiguous) words. And, including an example of one thing ("truth") that does not override the condition, and a poor choice of a word is problematic on two levels. One, it confuses the main/core statement of wp:ver, and secondly it leads to wide mis-quoting that wp:ver states that accuracy (in cases where objective accuracy exists) is not an objective of Wikipedia. So I think the "not truth" should also go. North8000 14:59, 23 December 2010 (UTC)

I agree with most editors here that the purpose of the "threshold" language is to say that it's not enough for something to be true, it must also be verifiable -- and even that may not be enough.

The main problem is that the principle, especially in the abbreviated form in which it is usually quoted ("verifiability not truth"), can be misunderstood as follows: We don't care whether something is true or not. We just check whether it's verifiable, and that's it.

This misunderstood version is actually convenient when dealing with fringers who claim that established and verifiable facts are false. (E.g.: "It is not true that Obama was born in Hawaii, so we can't say it." Response: We don't care whether it's true or not, only whether it's verifiable.) As a result, we have a significant number of experienced editors, including at least one high-profile admin, who believe this is the correct interpretation.

In the past we have had a number of situations in which there was a strong consensus among Wikipedia editors that all reliable sources that reported something were in fact wrong. These situations tend to attract editors who then claim that we are obliged to parrot the incorrect reporting as if it was true, because truth simply does not matter. This is particularly egregious in BLP cases and in cases such as the Sam Blacketer controversy article, which are about events that happened at Wikipedia. In these cases we are in the best situation to report the correct facts, and readers including journalists turn to us to learn about them, not for our parrotting of news sources without any warning that we know it's all misinformation.

For this reason I think the "threshold" wording needs tuning. However, the proposed change is certainly not optimal for this purpose. Hans Adler 15:33, 23 December 2010 (UTC)[reply]

How about

Verifiability is an absolute requirement for inclusion of material in Wikipedia. No other consideration (such as assertions of truth) is a substitute for meeting the verifiability requirement. North8000 15:57, 23 December 2010 (UTC)

I used "assertions of" in the example so as to not disparage the concept of accuracy. North8000 15:57, 23 December 2010 (UTC)

Perhaps Hans Adler would like to tell us what wording he would suggest? SamuelTheGhost (talk) 17:16, 23 December 2010 (UTC)[reply]

Look, the problem on this page (and so many other policy pages) is that the original text was intended to be evocative of a principle, but many editors try to read it as literal law in order to use it as a hammer in particular disputes they are engaged in. The solution to the problem is to get editors to think about the bigger picture. Twiddling with the wording is not really going to help much.

That being said, the idea this line is pointing to is that Wikipedia editors should not be trying to evaluate the ontological truth-status of various perspectives on a topic. What we have, instead, is an epistemological problem: How do we know which statements about a topic qualify as knowledge that should be included in the encyclopedia, and what kind of knowledge do those statements represent? The answer to the first question is that statements qualify for inclusion when they can be verified - i.e., when we can find credible published sources that make the statement in question. The answer to the second question relies on a judgement of whether the statement is a commonly accepted truism, an established perspective in a debate, a dated historical claim, the opinion of an individual, or etc. - in other words, how does the statement fit into the greater social and scholarly worlds. We are not trying to establish what is true about a topic, we are trying to present a snapshot of the prevalent beliefs about the topic.

I could propose better wording, but it's christmas ansd I have better things to do today.

--Ludwigs2 13:56, 25 December 2010 (UTC)[reply]

Very nicely said, Ludwigs. SlimVirgin ^{talk|contribs} 12:37, 28 December 2010 (UTC)[reply]

Perhaps Ludwigs2 might demonstrate that he "could propose better wording" some time before next Christmas? SamuelTheGhost (talk) 14:42, 4 January 2011 (UTC)[reply]

ok, but you're going to be disappointed next Christmas when there's nothing under the tree.

how about this? This would replace the entire intro:

As an encyclopedia, Wikipedia needs to ensure that statements made in Wikipedia articles accurately reflect the knowledge and information that exist in the greater world. The key to this is Verifiability - literally, the ability to cite a published source in the greater world which makes a similar statement in the correct context, so that any reader can verify that Wikipedia or its editors are not rendering their own opinions. Knowledge and information are not the same as 'truth', 'factuality', or 'correctness'. In some cases, important knowledge or information about a topic may be factually wrong, may contradict scientific evidence, may be the opinion of a person or group, or may even be overtly offensive or deeply polemical. As long as such statements can be verified, they can be used in the encyclopedia.
Verifiability is not sufficient in itself for including something in the encyclopedia, but needs to be used in combination with other content policies - such as Neutral Point of View - and with common sense. Context is the key issue here: users must verify not just that a given source says something, but also that the source is directly discussing the topic at hand and is qualified to discuss that topic, that the statement is a significant viewpoint in the context in which it is being used on Wikipedia, and that the statement is being used here in a way that is consistent with the way the source uses the statement in the greater world.
In some cases, statements are so self-evident that verifiability is assumed without need for actual citation. If someone asks for verification on a statement like "the sky is blue", it is appropriate to ask why such verification is needed.
This policy applies to all material that is considered part of the encyclopedia, including article text, article titles and section headings, lists, images, quotes, templates, and captions. It is applied very stringently to biographies of living persons.

.

I'm very sympathetic to that formulation, although it's quite a big change from what's there now. Two small comments:

You're a bit addicted to "in the greater world"
Things aren't "self-evident", but citation may be omitted for uncontroversial statements for which sources of verification are widely available, such as "most people have two legs" or "Manhattan is an island". (The sky often isn't blue) SamuelTheGhost (talk) 17:49, 4 January 2011 (UTC)[reply]

I like the tone, but I also have misgivings. One additional bit with which I would take issue is the assertion that "users must verify ... that the source is ... qualified to discuss that topic". No. Besides the special case of qualifying a recognized topic expert for acceptability as a SPS, I can't think of a circumstance in which editors (as distinct from users) need to verify (or, indeed, should make a judgment about) source qualifications. A rather extreme example would be qualification of e.g., Danny Glover or Rosie O'Donnell as acceptable sources for comments on world politics. Somewhat related to this is concern over the ambiguity of the term "source" -- If A publishes info that B asserted X about topic Y, then (presuming that A is a reliable source on the topic), is the degree of qualification of either A or B to discuss the topic a matter which requires editorial judgment by individual WP editors? Wtmitchell (talk) (earlier Boracay Bill) 22:00, 5 January 2011 (UTC)[reply]

"A threshold" vs. "The threshold"

Some editors have posted that there is more than one threshold. Yet the first word of WP:V is not "a" but "the". I support having one requirement for inclusion, the point being that the other policies are for exclusion. I've tried to draw a word picture in the essay [Policy_sculpting:_inclusion_versus_exclusion]. FYI, RB 66.217.118.63 (talk) 09:55, 25 December 2010 (UTC)[reply]

In that we are discussing "a" vs. "the" currently, please see the essay [WP:Policy_sculpting:_inclusion_versus_exclusion] and the talk page [WT:Policy_sculpting:_inclusion_versus_exclusion]. RB 66.217.118.148 (talk) 06:25, 9 January 2011 (UTC)[reply]

"condition" vs. "requirement"

These two definitions are from m-w.com

Definition of REQUIREMENT

1 : something required: b : something essential to the existence or occurrence of something else : condition <failed to meet the school's requirements for graduation>

Definition of CONDITION

2 : something essential to the appearance or occurrence of something else : prerequisite

I prefer "requirement" to "condition". RB 66.217.118.116 (talk) 01:19, 27 December 2010 (UTC)[reply]

"threshold" vs. "requirement"

"threshold" (1) No one has provided a definition that explains why we are using the word. (2) No one has disputed that one of the connotations of "threshold" is that there is a "threshold for the absence of inclusion". (3) No one supports having a "threshold for the absence of inclusion". In short, "threshold" is ambiguous.

"requirement" removes the ambiguity without changing the intent.

Proposal:

The ~~threshold~~ for inclusion in Wikipedia is verifiability, not truth; that is, whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think it is true.

RB 66.217.118.116 (talk) 02:07, 27 December 2010 (UTC)[reply]

Proposal: How about we kill three birds with one stone? For the reasons discussed above, proposal as follows:

Replace:

The threshold for inclusion in Wikipedia is verifiability, not truth; that is, whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think it is true.

With

The requirement for inclusion in Wikipedia is verifiability; that is, whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think it is true.

Sincerely, North8000 (talk) 12:24, 28 December 2010 (UTC)[reply]

It's not "the requirement." It's "the threshold," an entirely different concept. This is a very commonly accepted phrase now, and I think people do understand what it means given how often I see it being cited correctly. SlimVirgin ^{talk|contribs} 12:40, 28 December 2010 (UTC)[reply]

I think it is reasonable to say that verifiability is required, and not use the word "threshold". Am I missing something? I also have a problem with using an undocumented definition of "threshold". The ambiguity that we are currently trying to resolve includes removing the "threshold for the absence of inclusion".

Also, do you think these editors are citing correctly?

Finally, it's irrelevant if that statement you mentioned wasn't "true". Wikipedia represents verifiability, not truth. We simply present the information given by reliable sources. (Ref: [here]).

...("verifiability not truth"), can be misunderstood as follows: We don't care whether something is true or not. We just check whether it's verifiable, and that's it. (Ref: [here]).

...editors who then claim that we are obliged to parrot the incorrect reporting as if it was true, because truth simply does not matter. (Ref: [here]).

RB 66.217.117.121 (talk) 01:22, 2 January 2011 (UTC)[reply]

Hello SlimVirgin. "Requirement" simply says that it's necessary. IMHO "Threshold" doesn't have a single unambiguous meaning, but most commonly means "necessary and sufficient" (which is essentially a "shall be included irrespective of other considerations" statement) which IMHO is not correct for Wikipedia. That reason was the main thread of this discussion section, but was not my main motivation, which was to solve the problem that the current wording is often misinterpreted to say that accuracy (in cases where it objectively exists) is not an objective and not valued. Sincerely, North8000 (talk) 03:00, 2 January 2011 (UTC)[reply]

North8000, I agree with your last statement, and I am aware that fixing the ambiguity with "threshold" does not solve my problem either. But it helps. I think that with the absence of responses in opposition, and the clarity in the definition of "requirement"; we are at a point of consensus to make the limited one-word change. RB 66.217.117.60 (talk) 21:38, 6 January 2011 (UTC)[reply]

A tinier change than I was hoping for, but I think it's good. North8000 (talk) 21:50, 6 January 2011 (UTC)[reply]

I have a simpler solution: We should change the sentence to read, "The minimum threshold for inclusion..."

That's what we mean, after all: Information that can be verified may (not must) be included; information that cannot be verified may not be included. WhatamIdoing (talk) 05:08, 7 January 2011 (UTC)[reply]

I'm not sure that the term "minimum threshold" would withstand logical dissection, but I agree that it does convey what we mean.North8000 (talk) 12:11, 7 January 2011 (UTC)[reply]

I think we should put minimum threshold in the garbage receptacle for trash. If we need to adjective-ize it to make sense, we've probably chosen the wrong word. I like your suggestions, but this one seems like the wrong fix. Ocaasi (talk) 16:10, 7 January 2011 (UTC)[reply]

OK then, is anybody strongly opposed to "requirement"? If not I'd be willing to be bold (ok, semi-bold :-) ) and try putting it in. North8000 (talk) 19:35, 7 January 2011 (UTC)[reply]

I would not revert the change... but I really really really prefer "threshold". I think it is the best word to express what we mean (a line you must cross, the barrier that separates "outside" from "inside"). Blueboar (talk) 20:35, 7 January 2011 (UTC)[reply]

This isn't just one word like I thought. We can't just sub the one word because because then it would say "the requirement" which says that there is (only) one requirement. And so if we say "a requirement", now we are making a new (arguable) statement that truth is not a requirement, i.e "upgrading" that from an implication to a statement. IMHO the real fix would be my proposal:

"Verifiability is an absolute requirement for inclusion of material in Wikipedia. No other consideration (such as assertions of truth) is a substitute for verifiability."

which we haven't talked about much. I don't plan putting anything in at this time. Sincerely, North8000 (talk) 03:03, 8 January 2011 (UTC)[reply]

"requirement" vs. "absolute requirement"

To me the word "absolute" is verbiage. Also there is some text that has been removed without an explanation. Here is an alternate to North8000's proposal:
Was:

The threshold for inclusion in Wikipedia is verifiability, not truth; that is, whether readers can check that material in Wikipedia has already been published by a reliable source, not whether editors think it is true.

New:

"Verifiability is required for inclusion of material in Wikipedia; that is, whether readers can check that material in Wikipedia has already been published by a reliable source. No other consideration, such as assertions of truth, is a substitute for verifiability."

RB 66.217.118.148 (talk) 07:48, 9 January 2011 (UTC)[reply]

Your version is better and corrects my omission. I know "absolute" is technically redundant; I had it in for emphasis, but it's probably better to leave it out. North8000 (talk) 14:22, 9 January 2011 (UTC)[reply]

I still think threshold is better, but seem some improvement in the latter parts of the phrase. If it were up to me I'd use: "The threshold for inclusion in Wikipedia is verifiability, that is, whether readers can check that material in Wikipedia has already been published by a reliable source. No other consideration, such as assertions of truth, is a substitute for verifiability." FWIW. --Nuujinn (talk) 18:06, 9 January 2011 (UTC)[reply]

Than takes care of my main concern and reason that I consider this to be a big improvement. (previously discussed) Other folks were more concerned about the requirement vs. threshold. But I do think that requirement is less ambiguous than threshold. North8000 (talk) 18:35, 9 January 2011 (UTC)[reply]

I think this is an improvement over the current policy statement. It might even help clarify the "requirement" vs. "threshold" discussion. RB 66.217.117.27 (talk) 23:51, 9 January 2011 (UTC)[reply]

Personal communications

I've added a brief section about "personal communication" at Wikipedia:Verifiability#Personal_communication. There's a lot of examples of newbies using "personal communications" with authorities and contacts (which I started to clean up), but I couldn't see that we had an explicit statement against doing this - we don't allow it as nobody can check the source. It's a form of original research, so I would understand if it were best explained in that policy - or it could be mentioned in both. p.s. Could WP:OTRS in theory verify personal communications? Fences&Windows 23:02, 30 December 2010 (UTC)[reply]

IP addresses can't edit the Project Page, or I'd do this myself. The word "conservations" s.b. "conversations". FYI, RB 66.217.118.34 (talk) 04:27, 2 January 2011 (UTC)[reply]

Good catch. Fixed. -- Brangifer (talk) 05:18, 2 January 2011 (UTC)[reply]

This is definitely a good elaboration of policy. One thing I might add, however, would be a note clarifying whether it's OK to cite an otherwise reliable source for information which itself cites a personal communication for that information. For instance, Wikipedia article A says "John Smith was born in 1949", citing book B, which indicates that John Smith was born in 1949 and contains a footnote citation referencing a personal communication. —/Mendaliv/^2¢/_Δ's/ 19:48, 2 January 2011 (UTC)[reply]

I think the new section makes that fairly clear by stating that personal communications are cited in scholary works. The verifiability policy only applies to Wikipedia editors; authors of reliable sources need not pay any attention to it. Jc3s5h (talk) 20:36, 2 January 2011 (UTC)[reply]

I don't see this as needed. The whole of the rest of the policy makes clear that we need published sources, and as it was written it implied that personal communication even when published was not allowed, which of course it is. Fences and windows, do you have diffs of new editors relying on personal exchanges with people? SlimVirgin ^{talk|contribs} 03:49, 3 January 2011 (UTC)[reply]

Follow the "a lot of examples" link I gave, and browse through. There's many examples, and this search will only pick up those using the phrase "personal communication", there will be others using other wording. As "personal communication" is a convention elsewhere and editors do make this mistake, it is worth clarifying this point. Fences&Windows 03:21, 7 January 2011 (UTC)[reply]

Examples: Amore Bekker#cite note-name31-8, George R. Fischer#cite note-3. Fences&Windows 03:23, 7 January 2011 (UTC)[reply]

I have to agree with F&W here... I think this is a good addition to the policy. Blueboar (talk) 03:59, 7 January 2011 (UTC)[reply]

I disagree. It's at best pointless, because the rest of the policy makes clear that sources must be published, and the part about posting it on talk pages encourages editors to post other people's e-mails, which is not a good idea. SlimVirgin ^{talk|contribs} 04:54, 7 January 2011 (UTC)[reply]

The two most recent editors make it evident that the personal communications section is easy to misinterpret, as evidenced by Blueboar's misinterpretation. The current policy, which is not explicitly written but is implicit from the policy, has these points:

Reliable sources are free to receive personal communications from anyone, especially other scholars, and to make use of those personal communications in their work. Because these sources are reliable, they can be trusted to verify that the communication really came from the stated source and evaluate the correctness and importance of the personal communication.
Wikipedia editors are free to cite reliable sources which make use of personal communications received by the reliable source.
Wikipedia editors are not permitted to cite personal communications that the WP editor claims to have received from anyone, including experts, because we cannot trust Wikipedia editors to not fabricate the personal communications. If the information isn't published, we can't use it, and sending a personal communication to a WP editor does not constitute publication.
Personal communications a WP editor sends to him/herself are no different from the WP editor just making stuff up: not useable.
Personal communications between WP editors are not published and not useable
If a reliable source has made use of personal communications from a WP editor, it may still be cited in WP because it is a reliable source. Jc3s5h (talk) 05:01, 7 January 2011 (UTC)[reply]

Could we compromise on a footnote that says something like "E-mail messages or other types of personal communication sent to a Wikipedia editor are not published and therefore cannot be used to verify article content."? WhatamIdoing (talk) 05:04, 7 January 2011 (UTC)[reply]

I'd have no objection to that, though it seems pointless. Every single part of the policy is telling editors sources have to be published. I can't see a reason, 10 years into the existence of the project, to start listing all the kinds of unpublished sources we're not allowed to use. SlimVirgin ^{talk|contribs} 05:09, 7 January 2011 (UTC)[reply]

paraphrase

This relatively new sentence needs to be altered:

Be mindful of copyright: do not copy text from copyrighted sources, or paraphrase too closely, without in-text attribution."

It is wrong for two reasons. The first is that we can copy limited amount of text from a copyrighted source if it is a quote, and quotes can be attributed by an inline citation, they do not need in-text attribution. The second are sources that while under copyright can be copied or paraphrased: eg other Wikipedia pages and other appropriately licensed copyrighted sources under what is often described as copyleft without in-text attribution (although depending on the source they will need attribution in other ways); and other sources such as US government sources which can be copied or closely paraphrase which do not need in-text attribution (inline-citations will do). -- PBS (talk) 14:39, 1 January 2011 (UTC)[reply]

What wording do you suggest?—S Marshall T/C 15:33, 1 January 2011 (UTC)[reply]
How about: "Respect copyright [added: and avoid plagiarism]: do not copy verbatim or closely paraphrase text from a non-free source unless you are briefly quoting it in accordance with the non-free content policy. Use your own words to summarise sources." Fences&Windows 19:11, 1 January 2011 (UTC)[reply]

I know it is legal to copy or closely paraphrase text from a free source ... but I don't think we want to imply that doing so is an acceptable way to write an article. We want original writing (without original research). Blueboar (talk) 19:36, 1 January 2011 (UTC)[reply]

Blueboar I disagree with your position, copying text from encyclopaedic copyright expired sources such as EB1911 and DNB should not be discouraged as text from those sources in areas such as historic biographies of minor figures and descriptions of battles make for well written Wikipedia articles that can be modified and improved with new sources thorough the usual editing process, and as F&W has pointed out that boat has long since sailed so should not add wording that suggests that this is not allowed. --PBS (talk) 12:47, 2 January 2011 (UTC)[reply]

Ah, but quite a few articles are based on material taken verbatim from free sources. So we cannot add wording that suggests that this is not allowed. Fences&Windows 19:55, 1 January 2011 (UTC)[reply]

PBS, it's poor practice to copy from other sources, whether copyrighted or not, and we can't base one of our core content policies on the assumption that it's okay. SlimVirgin ^{talk|contribs} 21:26, 2 January 2011 (UTC)[reply]

How about: "Respect copyright and avoid plagiarism: do not copy verbatim or closely paraphrase text from a non-free source unless you are briefly quoting it in accordance with the non-free content policy. Summarise sources in your own words." [Addition is the wikilinking in "closely paraphrase".] I personally support Blueboar's addition but I think it belongs in a footnote rather than the main body of the policy so as to minimise policy-bloat.—S Marshall T/C 20:00, 1 January 2011 (UTC)[reply]

I think that's very close, but getting long. How about: "Respect copyright and avoid plagiarism: do not copy verbatim or closely paraphrase text from a non-free source unless you are briefly quoting. Summarise sources in your own words." (removed link to non-free content policy) Ocaasi (talk) 05:35, 2 January 2011 (UTC)[reply]

Free had two obvious meanings (free as in freedom and free as in really really cheap). Also for example Wikipedia text is not copyright free it is copyleft, so "a non-free source" does not cover it. It also does not cover the copyright status of work by the U.S. government. How about " ... text from a source not compatible with the Text of Creative Commons Attribution-ShareAlike 3.0 Unported License as used by Wikipedia unless you are ...". (The licence is linked at the bottom of every Wikiepdia page). But this does not cover SV's wish to include in-line attribution (which I assume was the reason for the current wording). -- PBS (talk) 12:47, 2 January 2011 (UTC)[reply]

But is a public domain source "compatible" with CC A-SA? Your phrasing doesn't seem to allow when the source does not have a license which requires attribution nor ShareAlike. -- SEWilco (talk) 21:14, 2 January 2011 (UTC)[reply]

Given that we have in the lead "The other three are No original research, Do not violate copyright and Neutral point of view." Do we need this second sentence in this policy on verification? -- PBS (talk) 10:38, 7 January 2011 (UTC)[reply]

In-text attribution when quoting

This page isn't about copyright, and the discussion above is getting a bit bogged down. If we're going to mention it, it should be very brief with a link to the relevant pages. The copyright issue apart, we should also make clear that quotations need some form of intext attribution so that people know who's being quoted. SlimVirgin ^{talk|contribs} 21:24, 2 January 2011 (UTC)[reply]

Although may be desirable for stylistic reasons, quotations don't need in-text attribution, they do need an in-line citation. As this is a content policy page about verifiability and not a manual of style page, there is no need to emphasise that point. -- PBS (talk) 21:40, 2 January 2011 (UTC)[reply]

It's not an MoS issue, and they do need attribution so that people know who is talking. SlimVirgin ^{talk|contribs} 21:42, 2 January 2011 (UTC)[reply]

SlimVirgin, you have made two substantially different statements, "quotations need some form of intext attribution" and "they do need attribution so that people know who is talking". A footnote may, depending on placement, satisfy your second statement but not your first. Also, we still have the liberty to import articles wholesale from public domain or compatibly licensed sources, and to attribute them with a general statement near the end of the article. Jc3s5h (talk) 01:19, 3 January 2011 (UTC)[reply]

Sorry, Jc, I don't follow. I don't see a difference between my first and second points. If you don't attribute in text, you "end up with" this kind of "random quoting" for no obvious "reason," and without "making clear" who is being "quoted." SlimVirgin ^{talk|contribs} 03:52, 3 January 2011 (UTC)[reply]

Just because a technique could be confusing does not mean it is always confusing. If there is only one quotation in a paragraph, and that quote is immediately followed by a footnote, there will be no confusion as to the source of the quote. Do not prohibit a writing technique in the policy just because it could be abused. Jc3s5h (talk) 04:15, 3 January 2011 (UTC)[reply]

It's not a question of abuse, but of poor writing. The reader will have to click on the footnote for the source, then click on the source to see if it's the source who's quoting someone else. And if it's not online they won't be able to find out that way. Sometimes it's not even clear whether someone is being quoted, or whether they're just scare quotes. The bottom line is that quotes need in-text attribution or they look silly. SlimVirgin ^{talk|contribs} 05:03, 3 January 2011 (UTC)[reply]

I'd like to see some situations, Jc3s5h, where it wasn't optimal to attribute inside the quote. I agree with your idea, that a technique shouldn't be prohibited just because it could be abused, but it's not apparent to me that there's every a reason to do what you're suggesting. In the case of a wholesale importation from another source, I assume the relevant quotation would not be the entire pasted text, but only the "quotations" therein, which would already have in-line attribution. In other words, we don't quote Britannica 1911, but we would reprint their inline attribution of others. What other situations are there? Ocaasi (talk) 08:35, 3 January 2011 (UTC)[reply]

Ocassi, policies should not demand "optimal" writing, they should just forbid writing that causes errors, confusion, or legal violations. For an example of a quote that does not name the speaker inline, just in a footnote, see the first quote in the lead of Second. I find that quote acceptable as is, although identifying the speaker wouldn't do any harm. In this case, there is an argument for leaving the name of the speaker vague, because although the speaker is technically the General Conference on Weights and Measures, virtually every government that has considered the matter has adopted the definition as a matter of law for most purposes. Not naming the speaker reflects the true situation: it is a pervasive definition that virtually everyone uses for most purposes. Jc3s5h (talk) 14:21, 3 January 2011 (UTC)[reply]

SV we have to give an in-line citation after a quote, and it is possible to have a perfectly valid quote that does not need in text attribution to be verifiable. I came across one yesterday in the article on the Siege of Drogheda

However, Cromwell lacked the technical training to systematically construct siege entrenchments and bombard a fortified place into surrender. "The Cromwellian siegework repertoire included only the first and/or last stages; that is assault, or failing that, blockade".^[8]

The quote is fully cited with a short citation ^{"Padraig Lenihan, Confederate Catholics at War, p175"} and a full source (isbn etc) listed in the References section.

"Be mindful of copyright: do not copy text from copyrighted sources" is too simple an explanation as I explained above because there are exceptions to that rule, for example limited copying placed in quotes and text from copyleft sources with compatible licences (such as other Wikipedia pages and Citizendium). -- PBS (talk) 14:45, 3 January 2011 (UTC)[reply]

That's not an example of a valid quote, Philip; in fact it's exactly what should be avoided. It's inappropriate in part because we have no idea who's being quoted, and in part because there's no hint as to why there's a quote there. Someone has simply chosen to place quotation marks around a sentence that was written by someone else, apparently at random. SlimVirgin ^{talk|contribs} 19:54, 3 January 2011 (UTC)[reply]

There are times when in-text attribution is needed for reasons of NPOV, but not for verifiability. SV you may not like the style, but the in-line citation after the quote satisfies verifiability in a way that an in-text attribution usually does not, or are you saying that the footnote does not give adequate information? An in-line citation is something that has been demanded for a number of years, this new requirement -- for in-text attribution -- does not improve verifiability and as Jc3s5h points out above is not always good style.

Further the current sentence requesting in-text attribution as part of verifiability causes problems with incorporating text from copyright compatible sources such as Wikipedia and Citizendium. As you have pointed out above trying to incorporate all of this is complicated, but it would be better to remove the current sentence without a replacement than leave it as it is. -- PBS (talk) 09:51, 4 January 2011 (UTC)[reply]

Since in-text attribution is a thorny issue, it's worth repeating that this is not about requiring it anywhere except for quotations. The notion of a quotation is inherently tied to its unique author. The above example seems like a particularly inapt edge case, because it refers to a commonly used scientific standard where there's little need for quotation or in-line attribution, since most everybody uses the same standard. Either, it could be prefaced by, 'Several international bodies use the definition...' or it's just not a unique quotation--more a definition, which is not much about its author, just about common terminology. Ocaasi (talk) 10:52, 4 January 2011 (UTC)[reply]

I disagree with Ocassi that "commonly used scientific standard where there's little need for quotation" first because the definition has been carefully crafted to state the exact conditions under which the measurement should be carried out; paraphrasing introduces a serious risk of a change in meaning. Second, the particular passage is adopted as binding law in many English speaking countries, and anyone who follows court cases understands how even a single word in a law has the potential to be at the center of a legal dispute. Thirdly, articles are plagued by drive-by editors and vandals who either overestimate their ability to rephrase important definitions, or deliberately introduce subtle vandalism. A quotation allows editors who are not specialists in the subject matter, but who are cleaning up after the troublesome editor, to just look at the source to repair the damage. Jc3s5h (talk) 14:15, 4 January 2011 (UTC)[reply]

Yes, I would expect we use the verbatim definition, too. I just think that's a bad example of a quotation, since the author is incidental to the accepted standard, which is shared by basically all serious scientific bodies. If it weren't a common standard, we would use in-text attribution, but since it is a common standard, it's just a scientific definition. So it's fine without in-text attribution, or it could easily be defined 'By the scientific community' or 'By major scientific bodies'. In any event, the author is incidental, so we're just citing any relevant source for the definition though many different sources could support it. Maybe that's not persuasive, but I don't think this case is a clear counterexample. It's also in the passive voice, 'has been defined to be', but we don't say by whom.69.142.154.10 (talk) 15:28, 4 January 2011 (UTC) Sorry, that was me...Ocaasi 16:41, 4 January 2011 (UTC)[reply]

This page is a policy, not a style guide. A statement that in-text attribution is usually better style, easier to read, etc., belongs in a style guide. A statement that quotes must be attributed in-line should not appear in a policy if any significant number of counter-examples can be found. Jc3s5h (talk) 15:35, 4 January 2011 (UTC)[reply]

Philip (or anyone else), can you give an example of a professional writer in any other publication randomly quoting people without making clear who said it, or why they're quoting that person? SlimVirgin ^{talk|contribs} 00:45, 6 January 2011 (UTC)[reply]

No, not at all. It would be unprofessional. My mantra: "When in doubt, atribute." -- Brangifer (talk) 03:19, 6 January 2011 (UTC)[reply]

SV, it was mentioned above ("Although may be desirable for stylistic reasons..." and "SlimVirgin, you have made two substantially different statements ...", you are asking a style question not a verification question. The in-line citation is all that is needed for attribution, in-text attribution is not sufficient. In-text attribution is only needed if one is closely paraphrasing (without quoting) a source to make it clear that the words are not those of a Wikipedia editor. Personally I would prefer that those words were quoted, but as you pointed out in a different section (on the issue of plagiarism) that should not be made compulsory. However for the reasons given above, by a number of editors, the current wording I quoted at the start of this section, is not suitable for this policy as it is both inappropriate for quotes and it is clearly wrong for text included from sources that are copyleft compatible. -- PBS (talk) 09:55, 6 January 2011 (UTC)[reply]

I've only ever seen this—the sudden appearance of quotation marks with no explanation or attribution—in three places: on boards outside grocers' shops (Lovely carrots "nice and fresh", 40p per lb); in letters from one of my aging relatives, who adds quotation marks at random, and given that she's been doing it all her life I don't have the heart at this late stage to ask her why; and on Wikipedia.

It's not just a style issue, because it raises verifiability issues. If I write this with an inline citation ...

Police are "combing the area" for forensic evidence where the body was discovered.^[1]

^ Smith, John. "Body discovered," The Times, January 6, 2011.

... the reader can't tell whether I'm quoting Smith or someone Smith has quoted, or whether I've added quotation marks as scare quotes, or—like my relative—for reasons no one can fathom. SlimVirgin ^{talk|contribs} 00:08, 7 January 2011 (UTC)[reply]

SlimVirgin, can you have this discussion elsewhere or at some other time? We were trying to work out wording for a note about respecting copyright before you hijacked this thread with irrelevances about in-line attribution. Fences&Windows 03:18, 7 January 2011 (UTC)[reply]

I've created a new sub-thread, which is now only about in-text attribution. SlimVirgin ^{talk|contribs} 05:00, 7 January 2011 (UTC)[reply]

It is not a question about verifiability because the in-text attribution does not allow a reader to verify anything, it is the citation that is needed to verify that the quote is correct. Whether in-text attribution is needed is a style question. For example suppose the footnote you gave said (Smith, John. "Body discovered," The Times, January 6, 2011, quoting a police statement issued by ... .) then it is fully attributed but not in-text. -- PBS (talk) 10:50, 7 January 2011 (UTC)[reply]

Philip, it's silly to want poor writing in the text, but then ask people to add attribution to the footnote. Better just to have the attribution where it belongs. SlimVirgin ^{talk|contribs} 23:12, 7 January 2011 (UTC)[reply]

The current wording is clearly wrong for text included from sources that are copyleft compatible with Wikipedia. -- PBS (talk) 10:50, 7 January 2011 (UTC)[reply]

I'm not sure that's true. If they're copyleft compatible, then they don't need quotation for copyright reasons. But that doesn't mean that we can "quote" them without in-line attribution. It's ok to plagiarize from a copyleft source, but if we are going to "quote" it, then we still need to say who the author of the quote is. Ocaasi (talk) 16:14, 7 January 2011 (UTC)[reply]

Ocaasi the current wording is "do not copy text from copyrighted sources, ..., without in-text attribution." it is not about quotes it is about copying text, and it states at text copied from any copyright source, (including text copied from other Wikipdia pages) has to have in-text attribution. -- PBS (talk) 15:27, 8 January 2011 (UTC)[reply]

Our job as editors is to make sure the reader can access accurate information, and as easily as is possible. With out inline attribution on quoted content confusion is possible as some of the comments above suggest. In both my fields quoted sections with out attribution would not be considered acceptable nor would I accept that kind of writing from a student. This becomes even more critical on articles where there are many sources and possibly quotes that can be attributed to multiple speakers. Why not make it clear and easy for the reader? Why not follow a standard that has been used for a long time and is useful in academia? We don't have to reinvent the wheel. If something works, and is useful to us, why not use it.(olive (talk) 19:05, 7 January 2011 (UTC))[reply]

Exactly, and it's not just in academia. Professional writers don't add quotation marks without making clear in some way who is being quoted. It's puzzling to see anyone argue that our content policies should encourage poor writing. SlimVirgin ^{talk|contribs} 23:16, 7 January 2011 (UTC)[reply]

It can be made clear thorough an in-line citation (which has long been mandated by this policy), it does not have to be done with an in-text attribution. -- PBS (talk) 15:30, 8 January 2011 (UTC)[reply]

Why do you think it's okay for Wikipedia to encourage unprofessional writing of that kind? And if you're going to argue that it's not unprofessional, you'll have to show us some examples of professional writers doing it. SlimVirgin ^{talk|contribs} 15:39, 8 January 2011 (UTC)[reply]

To leave out mention of in-text does not affect verification, and this is verification policy not a style guide (so the issue of whether it is "unprofessional writing" is IMHO not pertinent to verification) . As I have written repeatedly, if it is to be included then it has to be qualified for the way we handle incorporation of copyleft text and other specific sources such as those from US government, and that is a complication that does not need to be in this policy. -- PBS (talk) 09:35, 9 January 2011 (UTC)[reply]

Dataspaces and OR

Recent policy announcements by NIH and NSF require funded research workers to put results in sustainable dataspaces. See [1], [2]. (This is not mentioned in Dataspaces.

Will WP consider information in the DataSpace of a major research university self-publication or publication by the university?

Wikipedia reliability

Sorry to rake up "reliability of master's dissertations" again, but I am going through guidelines that affect how I contribute to WP, and the issue concerns me from another angle. This comment is meant to be self contained.

In Wikipedia:Identifying reliable sources, Section 2.1, (titled "Scholarship"), the final sentence of the 3rd bulleted item states: "Masters dissertations and theses are only considered reliable if they can be shown to have had significant scholarly influence."

Should the word "reliable" (together with its morphological variants) in this, and equivalent contexts, be replaced by "Wikipedia reliable" or "WK:reliable" or some other denotation that shows its specialized usage?

Readers with experience of supervising and examining master's dissertations, and presenting these in discussions of priority, could consider the statement denigratory to their professional integrity.

Also, the usage just quoted conflicts with the definitions of "reliable" in the Oxford English Dictionary and the Random House Dictionary of the English Language, and with the usage of the word in the approximately 25 places where it occurs in Reliability of Wikipedia.

I just ran some searches relating to master's dissertations.

The web site for the MIT Master of Science in Management Studies [3] mentions four exemplary master's dissertations with email addresses of faculty advisors. It seems that the Sloane School is proud of its master's dissertations.

A Google Scholar search on "Masters dissertation" brought up 222,000 hits. I have looked at a few. Some relate to unpublished dissertations which have been cited in scholarly journals. Other hits relate to studies of the use of master's dissertations as sources of information that are not available elsewhere.

The National Library of Medicine Manual of Style devotes an entire chapter to the style of citation to be used for master's and doctoral dissertations. Presumably this shows the perception of a need to cite master's dissertations by the U.S. agency responsible for the dissemination of biomedical information.

The claim that master's degrees are seldom terminal for people conducting serious research ignores accepted professional career paths of many people who conduct serious research in M.S., M.Sc., M.A, M.B.A, M.P.H, M.F.A. and many other master's degree programs.

The suggestion that reliability is conferred by significant scholarly impact raises difficulties, without a definition of "significant". Also, there have been many occurrences in research when the significance of a paper in a peer reviewed journal was not recognized for several years. And the scholarly impact of most peer reviewed articles, measured by the number of citations in later articles, is zero (this can be seen in Web of Science and Scopus searches).

The fact that a dissertation has been cited shows its perceived interest content, not its reliability (the citing paper might even consider the dissertation erroneous).

Hope I have not repeated myself too much. But I am concerned about outside scepticism of WP, and the wording I have questioned does seem vulnerable.

Michael P. Barnett (talk) 22:40, 9 January 2011 (UTC)[reply]

[1] Smith, John. "Body discovered," The Times, January 6, 2011.

[1]