[go: up one dir, main page]

Wikidata:Property proposal/Place


Property proposal: Generic Authority control Person Organization
Creative work Place Sports Sister projects
Transportation Natural science Computing Lexeme

See also

edit

This page is for the proposal of new properties.

Before proposing a property

  1. Search if the property already exists.
  2. Search if the property has already been proposed.
  3. Check if you can give a similar label and definition as an existing Wikipedia infobox parameter, or if it can be matched to an infobox, to or from which data can be transferred automatically.
  4. Select the right datatype for the property.
  5. Read Wikidata:Creating a property proposal for guidelines you should follow when proposing new property.
  6. Start writing the documentation based on the preload form below by editing the two templates at the top of the page to add proposal details.

Creating the property

  1. Once consensus is reached, change status=ready on the template, to attract the attention of a property creator.
  2. Creation can be done 1 week after the creation of the proposal, by a property creator or an administrator.
  3. See property creation policy.


Filial church

edit

Park

edit

Hierarchy of administrative territorial entities

edit

Street

edit

Body of water

edit

Geographic location

edit

Yandex Maps API place ID

edit
   Ready Create
Descriptionunique object identifier on Yandex maps
Data typeExternal identifier
Domaingeographical feature (Q618123)
Allowed valuesnumber
Example 1Park named after Tsiolkovsky (Q130536892)31296578132
Example 2Red Square (Q41116)1520633025
Example 3Peterhof Museal Reserve (Q4360238)1520340105
Example 4St. Nicholas' Church (Q613788)244551532294
Example 5Gamla stan metro station (Q1208392)134085274659
Formatter URLhttps://yandex.ru/maps/org/$1
See also2GIS place-ID (P12487)
Single-value constraintyes

Motivation

edit

For geographic objects, it is useful to add the Yandex.Maps identifier, one of the most popular mapping services in the post-Soviet space.  – The preceding unsigned comment was added by Afanasovich (talk • contribs) at 10:47, 16 October 2024 (UTC).[reply]

Discussion

edit

Songkick area ID

edit
Descriptionidentifier for an area, on Songkick
Data typeExternal identifier
Allowed values([1-9]\d*)
Example 1Dresden (Q1731)28467
Example 2Birmingham (Q2256)24542
Example 3Salem (Q43919)10501
Sourcehttps://www.songkick.com/
Formatter URLhttps://www.songkick.com/metro-areas/$1
Single-value constraintyes
Distinct-values constraintyes

Motivation

edit

Preparation in view of repeated data donations by Songkick, relevant information on areas. --Paspal (talk) 20:29, 15 October 2024 (UTC)[reply]

Discussion

edit

@Paspal, Yuriklim, Schmidt Fu, Dcflyer: ✓  Done: Songkick area ID (P13080). --Lewis Hulbert (talk) 14:37, 24 October 2024 (UTC)[reply]

‎Google Plus code

edit
   Under discussion
DescriptionIdentifier for a location as seen on Google Maps
RepresentsOpen Location Code (Q23582265)
Data typeString
Example 1Eiffel Tower (Q243)V75V+8Q Paris, France
Example 2Empire State Building (Q9188)P2X7+9P New York
Example 3Alley Theatre (Q3612530)QJ6M+RV Theater District, Houston, TX
Sourcehttps://maps.google.com/pluscodes/
Robot and gadget jobsimplementing

Motivation

edit

I believe this could be used widely on the Wikidata, as Plus codes are open-source and free to use, which aligns with Wikidata's free and collaborative foundation. Though I am not sure how to create and implement properties myself, I believe this could be a valuable addition for places.  – The preceding unsigned comment was added by The Sharpest Lives (talk • contribs) at 16:15, 15 October 2024 (UTC).[reply]

Discussion

edit

Finlandssvenska bebyggelsenamn ID

edit
   Under discussion
Descriptionidentifier for a Swedish toponym in Finland
Data typeExternal identifier
Domainitem, geographic entity (Q27096213)
Example 1Åland (Q5689)1
Example 2Kupittaa (Q2365158)924
Example 3Lappeenranta (Q181854)2800
Sourcehttps://bebyggelsenamn.sls.fi/
External linksUse in sister projects: [ar][de][en][es][fr][he][it][ja][ko][nl][pl][pt][ru][sv][vi][zh][commons][species][wd][en.wikt][fr.wikt].
Planned useMoving the data from described by source (P1343) to an external ID on 1268 items. Maybe Mix'n'match for the rest.
Number of IDs in sourceover 2800
Expected completenesseventually complete (Q21873974)
Formatter URLhttps://bebyggelsenamn.sls.fi/bebyggelsenamn/$1/-/
Applicable "stated in"-valueFinlandssvenska bebyggelsenamn (Q116779391)
Single-value constraintyes
Distinct-values constraintyes

Motivation

edit

Finlandssvenska bebyggelsenamn is a dictionary for Swedish toponyms in Finland. User Robertsilen has linked hundreds of the entries with described by source (P1343), and I think this deserves its own external ID. Samoasambia 07:59, 17 October 2024 (UTC)[reply]

Discussion

edit

  Notified participants of WikiProject Finland. Samoasambia 08:01, 17 October 2024 (UTC)[reply]

GERS ID

edit
   Under discussion
DescriptionGlobal Entity Reference System (GERS) identifier, created and maintained by the Overture Maps Foundation
RepresentsGERS ID (Q121705182)
Data typeExternal identifier
Domaingeographical feature (Q618123)
Allowed values[a-z0-9]{32}
Example 1Reichstag (Q151897)08b1f1d488708fff0200bf6d40c1cf8f
Example 2Brandenburg Gate (Q82425)08b1f1d488601fff02002aee6ba961a4
Example 3Berlin (Q64)0851d7537fffffff01f0a6a39902a5ce
Sourcehttps://docs.overturemaps.org/gers/, https://explore.overturemaps.org/
Distinct-values constraintyes

Motivation

edit

The GERS ID (Q121705182) is intended to be a stable identifier to associate external data to an Overture Maps Data (Q121493528) element. The identifier has been developed and is maintained by the Data Working Group (OMF) (Q121493561), a working group of the Overture Maps Foundation (Q115725620). --Robot8A (talk) 14:00, 17 October 2024 (UTC)[reply]

Discussion

edit

Neighborhood

edit

Architectural structure

edit

Outer space

edit
Please visit Wikidata:WikiProject Space for more information. To notify participants use {{Ping project|Space}}

Classification of human settlements

edit

‎male mean age

edit
   Under discussion
Descriptionmale mean age in a given place; qualifier of mean age (P4442)
Data typeNumber (not available yet)
Domaincountries, municipalities, settlements, ...
Allowed values0-150
Allowed unitsyear (Q577)
Example 1Prague (Q1085)mean age (P4442)=42.0 → male mean age = 38
Example 2Brno (Q14960)mean age (P4442)=41.8 → male mean age = 36.8
Example 3Vysočina Region (Q190930)mean age (P4442)=46 → male mean age = 40.1
Sourcevarious databases, eg. Czech Statistical Bureau
Planned useimport values for Czech municipalities

Motivation

edit

There are qualifiers female population (P1539) and male population (P1540) for population (P1082), but there are no equivalent qualifiers for mean age (P4442). The data for the qualifiers are available. Standazx (talk) 07:39, 7 September 2024 (UTC)[reply]

Discussion

edit

fe‎male mean age

edit
   Under discussion
Descriptionfemale mean age in a given place; qualifier of mean age (P4442)
Data typeNumber (not available yet)
Domaincountries, municipalities, settlements, ...
Allowed values0-150
Allowed unitsyear (Q577)
Example 1Prague (Q1085)mean age (P4442)=42.0 → female mean age = 45
Example 2Brno (Q14960)mean age (P4442)=41.8 → female mean age = 42.7
Example 3Vysočina Region (Q190930)mean age (P4442)=46 → female mean age = 51.6
Sourcevarious databases, eg. Czech Statistical Bureau
Planned useimport values for Czech municipalities

Motivation

edit

There are qualifiers female population (P1539) and male population (P1540) for population (P1082), but there are no equivalent qualifiers for mean age (P4442). The data for the qualifiers are available. Standazx (talk) 07:40, 7 September 2024 (UTC)[reply]

Discussion

edit

Other

edit

‎most populous settlement

edit
   Under discussion
Descriptioncity, town, or other settlement with the largest population in this area (country, state, county, continent, etc.)
Representslargest city (Q51929311)
Data typeItem
Domaingeographic region (Q82794)
Allowed valuesinstances of human settlement (Q486972)
Example 1United Kingdom (Q145)most populous settlementLondon (Q84)
Example 2Maryland (Q1391)most populous settlementBaltimore (Q5092)
Example 3South America (Q18)most populous settlementSão Paulo (Q174)
Planned usereplacement of statements modeling the inverse relation with instance of (P31)/of (P642)
See alsocapital (P36)

Motivation

edit

The most populous settlement in a territory is a very commonly cited piece of information that is not well modeled at present. With no specific property, the information can only be expressed by using a qualifier, e.g. London (Q84)instance of (P31)largest city (Q51929311)of (P642)United Kingdom (Q145), or United Kingdom (Q145)has characteristic (P1552)largest city (Q51929311)applies to part (P518)London (Q84). In each case, it is not clear what the most appropriate qualifier is: in the former statement, for example, the overloaded of (P642) could instead be any of country (P17), part of (P361), relative to (P2210), or characteristic of (P13044).

Accepting the need for a specific property, the only question is the direction of the relation, and I think the answer is clear: as with capital (P36), this information is best modeled by linking the parent territory to the settlement, rather than the other way around, since this enables use of a single-best-value constraint (Q52060874), where there are multiple values corresponding to different dates or measurement methods.

A similar proposal was rejected 11 years ago, on the basis of 1) the vagueness of the word "largest" (which the present proposal does away with), and 2) the idea that the most populous settlement of a territory should be inferable from population (P1082) statements, and thus doesn't warrant a property. However, that naively assumes complete information and consistent measurement methodology, and does not allow for any qualification or referencing regarding the "most populous" determination.  – The preceding unsigned comment was added by Swpb (talk • contribs) at 15:41, 7 October 2024 (UTC).[reply]

Discussion

edit

  Notified participants of WikiProject Cities and Towns

  •   Support This is useful for infoboxes and makes it possible to keep track of which city is the largest over time. Dexxor (talk) 15:58, 7 October 2024 (UTC)[reply]
  •   Support, good idea! Pallor (talk) 21:35, 7 October 2024 (UTC)[reply]
  •   Support while this information could be derived from a query, with WDQS struggling, it seem better to provide a short cut. Vicarage (talk) 19:31, 14 October 2024 (UTC)[reply]
  •   Conditional support Only when a Wikipedia infobox must use this property (please name the template and wiki). Otherwise why not use WDQS? Midleading (talk) 03:56, 15 October 2024 (UTC)[reply]
    You can't have a shortcut designed to reduce WDQS stresses and then clutter it with use qualifiers that burden the system more. Vicarage (talk) 05:18, 15 October 2024 (UTC)[reply]
    There are only 94 items like London (Q84)instance of (P31)largest city (Q51929311)of (P642)United Kingdom (Q145). I can't see how these 94 items clutter WDQS compared to 100,000,000+ other items. And also only 2 of them have references, so this property could be 80% unreferenced if created. Also noting that the value can be inferred from population (P1082) statements via WDQS, which are more than 95% referenced and have more than 3 million uses. So what I see is just contradictory to what is claimed: there is complete information at population (P1082), and about 95% have references, but this proposed property can't provide comparable quantity and quality at all, can't even provide coverage on every country in the world (unless they are filled by a bot which inferred the information from WDQS). I would just suggest removing these instance of (P31) statements altogether, if no wiki and nobody is using them. Midleading (talk) 07:55, 15 October 2024 (UTC)[reply]
    Of course it could be derived by a WDQS for all settlements in an area sorted by population, limit 1. And indeed it should be populated that way, without needing references, to save everyone running such an expensive query again. of (P642) is being removed of course. WD has many shortcuts to data that could be derived other ways, this seems a sensible addition Vicarage (talk) 11:13, 15 October 2024 (UTC)[reply]
    I prefer to running a query every time. It ensures the users get the latest data rather than unmaintained, outdated and incomplete data added by P642 users. Why trust a bot running the same WDQS query only once a year when I can do it myself? Midleading (talk) 05:44, 19 October 2024 (UTC)[reply]
    Speed and ease of use. Its not as if settlements are jostling for position, and the vagaries of boundaries means its meaningless to say Leeds overtakes Bradford. You are very keen on the mul project to improve performance, so it seems odd to not be concerned about it here. I guess the real arbiters are the Wikipedian infobox people who will decide whether a slower WD derived answer is better than a static hard-coded one. And offhand, I'd struggle to write a query to do all the selecting, counting and limiting, when using the property would be trivial. Vicarage (talk) 06:31, 19 October 2024 (UTC)[reply]
    Wikipedia can't use WDQS at all. Data must be directly or indirectly linked from the item of the article subject, inverse links are not supported. So it is acceptable to create a property for use on Wikipedia even if WDQS can also be used. But in this case, the proposed property is hard to use on Wikipedia, because Wikipedia article of United Kingdom (Q145) can't find the inverse link to London (Q84), and London (Q84) doesn't need to link to United Kingdom (Q145) in infobox in this way either. Another problem is that Wikipedians want to keep data on the wiki page rather than Wikidata for various reasons. A lot of articles are still written as if COVID-19 never happened and Russia never started a war in Ukraine, and Wikipedians think it's better that way. So, if no Wikipedia really wants to use it, then it's not necessary to create the property that nobody uses, and that requires WDQS again for finding the inverse link. Midleading (talk) 17:36, 19 October 2024 (UTC)[reply]
  •   Oppose the most rejections last time was that this is queryable, and that is still the case. In essence, this is redundant information. The argument that about the data has not been added to Wikidata is just as applicable to this property, and instead of adding this statement, the energy could be spent on adding a good population statement. Ainali (talk) 12:08, 15 October 2024 (UTC)[reply]
  • In practice, I would want this property to be usually used with start time (P580) and have that as a required qualifier. That information is a bit more complex to get with WDQS and thus more useful to actually store.
In general, the usage in Wikipedia template does matter. Especially, for smaller Wiki's it might be a way to get them useful data from Wikidata. ChristianKl12:14, 15 October 2024 (UTC)[reply]
  • Thinking about more,   Oppose in the present version. London (Q84) is not the most populous settlement in the UK but Greater London (Q23306) is. Any properties where the examples someone comes up when proposing it are wrong, is likely to be misunderstood in practice. urban agglomeration (Q159313) subclasses urban area (Q702492), which subclasses human settlement (Q486972). The term settlement includes entities that aren't cities on also entities like Rhine-Ruhr Metropolitan Region (Q164903). I don't think our iterms for urban agglomeration (Q159313) are reliable enough currently that simply running a query sorted by population with limit 1 would do the job. ChristianKl12:27, 15 October 2024 (UTC)[reply]
    Yeah, WDQS doesn't always work in such a complex world (for example, why London is a settlement, not Greater London, not Great Britain?). The only reliable way is to find a good source as reference, and this way it can also be used on Wikipedias. But I doubt anyone is going to do that after this property is created. Midleading (talk) 17:22, 15 October 2024 (UTC)[reply]
    A more reliable way would be to create a "most populous city" or "most populous urban area" property instead of one about settlements. ChristianKl22:12, 15 October 2024 (UTC)[reply]
    Greater London is not a settlement, its an urban_agglomeration, a type of region, which would not be valid for this proposed property. And most_populous_city won't work for regions nested in regions, as they all are. Vicarage (talk) 23:13, 15 October 2024 (UTC)[reply]
    By the same logic, London wouldn't work because it's a city and not a settlement. In this context I would take settlement to mean instance of (P31) of an subclass of human settlement (Q486972). If you try to identify settlements by looking at the earth with a satellite image, you are seeing an urban area (or urban_agglomeration) and not seeing a city. A city is an entity with boundaries that are set by law.
    The term settlement is the superclass that covers both entities that are have their boundaries defined by administratively and entities that you would identify by looking at a satellite image.
    If you go to https://en.wikipedia.org/wiki/List_of_largest_cities you find a list where the item we have on Wikidata for a city usually corresponds to the "city proper" column. You can see that, by the fact that the item for that city is going to have the population in that column. On Wikidata, we have separate items for the urban area (and in cases where you have a single city with surroundings that's an urban agglomeration). For good fun there's also metropolitan area (Q1907114). ChristianKl17:42, 16 October 2024 (UTC)[reply]
    Good point that needs to be addressed. Perhaps we can solve the ambiguity of the term "city" by enforcing the use of qualifiers like criterion used (P1013)city proper (Q5124027) and criterion used (P1013)metropolitan area (Q1907114) if it makes a difference. Dexxor (talk) 06:56, 17 October 2024 (UTC)[reply]
    As far as I know, within Wikidata we don't have ambiguity about how the term city is used. It means an administrative entity. It's boundaries are the boundaries of the administrative entity and it's population is the number of people who live within the boundaries of that administrative entity. We then have extra items for the urban area (Q702492) and metropolitan area (Q1907114) and someone who wants to know the population (P1082) can look at those items.
    I tried to run a few queries and it's interesting that queries often timeout. I however found Maryland (Q1391)most populous settlementWashington metropolitan area (Q2367175). ChristianKl12:23, 20 October 2024 (UTC)[reply]
  •   Support Valuable information, useful to make readily accessible. I would really like to see this for eg all civil parishes in the UK, to identify the most significant population centre in the parish. The objections above that "this information is queryable" don't cut it for me, for at least two four reasons:
    (i) in many environments where this would be useful, eg the infobox of a Commons category, queryability is not available;
    (ii) there are queries where one might want to return this as convenience information for a large large number of rows: if that query was to have to compute the largest population centres, as opposed to just looking it up, that would not only requiring writing a query that is appreciably non-trivial, it is also a query that may not complete;
    (iii) the information can only be obtained by query if we have complete population information to query over. But as eg query https://w.wiki/Bczb for UK administrative areas reveals, this is not so. Bardsey Island (Q1991483) (population: 4) is not the most populated place in the civil parish Aberdaron (Q2556821) -- the most populated place is the village of Aberdaron itself, but unlike the island we don't have a population statement for the village. Worthwhile, therefore, if we can import this information from an appropriate reliable source (eg the full UK census) that does have complete coverage, or otherwise from a source that we can depend on;
    (iv) historic information is also of value -- what used to be the most significant population centre (but has since been surpassed) -- with a property, and an 'end time' qualifier, we can record this.
IMO, if well-sourced, the information is likely to be pretty long-term stable, so the data en:denormalization involved seems to me acceptable. -- Jheald (talk) 17:06, 21 October 2024 (UTC)[reply]
I was already thinking of writing about other aspects, but this post came in handy. The negative comments are on a rather narrow range of issues, whereas the proposed feature has a much wider range of uses. It can be any administrative unit in the world, however small and insignificant, within which the seat and the most populous municipality settlement may differ. It can also be used to record historical data for administrative units where the most populous municipality settlement has changed over the ages. So I can see this feature being useful in many more places than London. Pallor (talk) 17:23, 21 October 2024 (UTC)[reply]
@Pallor the proposed property is not named "most populous municipality" and frequently the most populous settlement is not a municipality. If you actually want a property to store the "most populous municipality" creating a property called "most populous municipality" would be more better than the proposed approach.
If one person expects the property to contain the "most populous municipality" and another expects it to actually contain the "most populous settlement", uses of infoboxes are going to be unhappy. I don't understand why you would want that mess, if we could also create a "most populous municipality" property where most users likely have a good intuition of what values it will contain. ChristianKl09:56, 23 October 2024 (UTC)[reply]
It was a bad translation that I didn't notice. Thanks for telling me, I fixed it. Pallor (talk) 10:26, 23 October 2024 (UTC)[reply]
If you prefer using "most populous settlement" over "most populous municipality", why do you dislike having the property as ""most populous municipality"? ChristianKl12:04, 23 October 2024 (UTC)[reply]