GRIMS Tables

From ICISWiki

Jump to: navigation, search

The following tables and fields and levels will be implemented unless anyone objects. Changes to tables are in italics


The following tables and fields and levels will be implemented unless anyone objects. Changes to tables are in italics

The following tables and fields and levels will be implemented unless anyone objects. Changes to tables are in italics

Contents

Changes to tables

LOCATION - change in Central

One change - adding a field NNPID for the LOCID of the nearest named place (as agreed in 2008). This will typically be used for a location where LNAME is a text description of the locality, and NNPID points to the village closest to that locality

Usage for IRGCIS collecting locations: LNAME of a location will contain the smallest scale name or description available from IRGCIS; in order: country, province, district, town, village, locality description. Validated interpreted higher level fields respectively go in CTRYID, SNL1ID, SNL2ID, SNL3ID, NNPID. Original unvalidated data on country, province, district, town, village will go into LOCDES exactly as provided in the format "Field=value". Coordinate data go into GEOREF.

Columns - Long Name (Name) Description Type Length (bytes)
LOCATION_ID (LOCID) The unique identifier of the location. Long 4
LOCATION_TYPE (LTYPE) Specifies the type Location E.g., Country, Province or Nursery test site. Links to types defined in the UDFLDS table Integer 2
NO_LATLONG (NLLP) Number of LAT-LONG points in the polygon describing the Location. Zero if no LAT-LONG Integer 2
PREF_LOC_NAME (LNAME) Preferred name of the location

This may be a standardised name, such as from a digital gazetteer, or a coded name such as BUR010, or a text description, such as “2km E of Los Baños”. Note that there is no requirement to assign a name to an unnamed location; an unnamed location may instead be identified by its latitude-longitude and/or altitude coordinates (in table GEOREF) and/or the country and subnat divisions in which it is located.

Character 60
ABBREV_LOC_NAME (LABBR) Preferred abbreviated name of the location Character 8
NEAREST_NAMED_PLACE (NNPID) Name of the nearest named place. Typically be used for a location where LNAME is a text description of the locality, and NNPID points to the village closest to that locality. For the purposes of data curation, note the conceptual difference: SNL*ID describe the hierarchy of administrative units within which the location occurs: whereas NNPID refers to the nearest named place. NNPID is thus not necessarily within SNL*ID. Long 4
SUBNAT _LEVEL3 (SNL3ID) Name of third level of political subdivision within country (e.g. Vereda). Long 4
SUBNAT _LEVEL2 (SNL2ID) Name of second level of political subdivision within country (e.g. District or County). Long 4
SUBNAT _LEVEL1 (SNL1ID) Name of first level of political subdivision within country (e.g. State, Province Department) Long 4
COUNTRY_CODE (CNTRYID) Country code linked to COUNTRY lookup table Long 4
LOCATION_REPLACE (LRPLCE) Own LOCID for deletion, replacement LOCID for replaced records, zero otherwise Long 4

GEOREFERENCE (GEOREF) (change in central)

Changes here are required to document the source and accuracy/precision/certainty of georef data.

Columns - Long Name (Name) Description Type Length (bytes)
LOCATION_ID (LOCID) Identifies the location. Link to the LOCATION table. Long 4
LATLON_POINT_NUMBER (LLPN) The lat-lon point (1 to NLLP). Integer 2
LATITUDE (LAT) Latitude in decimal degrees (+ for North, - for South, -1E36 for missing). Real 8
LONGITUDE (LON) Longitude in decimal degrees (+ for East, - for West, -1E36 for missing). Real 8
ALTITUDE (ALT) Altitude (m above sea level, -1E36 for missing) Real 4
LatLong_Source (LLSource) The type of source used for lat-long coordinates (e.g. GPS, digital gazetteer, map, collecting form, collecting report, provider's database with unspecified source: lookup UDFLDS) Integer 2
LatLong_Format (LL_FMT) Format in which Lat-Long coordinates were originally provided (lookup UDFLDS) Integer 2
LatLong_Datum (LL_Datum) Datum used for coordinate data (Lookup UDFLDS). (Assume WGS84 if unknown) Integer 2
LatLong_Uncertainty (LL_Uncert) Quantitative estimate of the uncertainty of the measure of coordinates Real 8
LatLong_Reference (LLRef) Reference to specific map, gazetteer, etc. Links to reference in BibRefs table Integer 4

INSTITUT (change in central)

Three types of change:

  • New field LOCID, referencing the institute record with its corresponding record in LOCATION.
  • Delete address fields: they will go into a new table ADDRESS
  • Merge IRGCIS and IRIS INSTYPE:
    • IRGC -> IRIS
    • CG -> INTER
    • GB -> GBANK
    • ID -> INDIV
    • NG -> NGO
    • NP -> NATION
    • PR -> PRCOMP
    • RO -> REGION
    • UI -> UNIVER

Note: INSTITUT is used to record details of business organizations that have some sort of business relationship with IRRI, e.g. for the purpose of exchange seed. In this sense, even a private individual could be considered as a “business organization”. By generalising the concept of INSTITUT to include any business organization, we can simplify the management of private and business addresses, by linking persons to addresses only through institutes for both business and private addresses. Note that if we need to know the private and business address of a person, we’d need to have two different records for the person, pointing to different INSTITIDs – just in the same way as for a person working for two organizations.

Field name Description Type Length
INSTITID INSTITUTION_ID Unique identifier for each institution Integer 2
PINSID INST_PARENT_ID. INSTID for the “parent” institution. Equal to INSTID if on top of hierarchical structure. In most cases, it is expected that PINSID will not be used, and INSTITID points to the legal entity that enters into legal agreements with IRRI. The exception is envisaged to be for INSTYPE=634=Department, in which PINSID represents the legal entity. Integer 2
LOCID Points to corresponding Location record (Non-null - institute must have a record in location; a location doesn't have to be of an institute) Long 4
INSNAME INSTITUTION_NAME. Full name of the institution. Char 150
INSACR INST_ACRONYM. Acronym of the institution (e.g., CIMMYT, IRRI, INIFAP) Char 20
INSTYPE INSTITUTION_TYPE. To specify the type of institution:, International center, National agricultural research service, University, etc. Link to the UDFLDS table. Integer 2
WEBURL WEB_SITE_URL. Char 60
SINS START_INST_DATE. Date the institution was founded. Format: YYYMMDD. Integer 4
EINS END_INST_DATE. Date the institution was closed or re-named. YYYMMDD. Integer 4
ICHANGE INSTITUTION_CHANGE. If the institution is changed to another institution. Integer 2

New tables

ADDRESS Table (add to central)

New table to hold addresses of institutes / people

Usage: Provide for multiple addresses per institute/person/location

Field name Description Type Length
AddressID Address_ID Unique identifier for the address. Primary key Integer 2
LOCID The ID of the location (institute, individual, department etc) to which the address applies. May be multiple AddressIDs sharing the same LOCID, representing different types of address Long 4
AddrType AddressType. Type of address, such as shipping address, port of entry, normal mailing address, Courier address, P.O Box, Street address, Personal address. Lookup in UDFLDS Integer 2
Addr1 1st line of address. Char 125
Addr2 2nd line of address Char 125
CITY City. Char 30
STATEID STATE_OR_PROVINCE. State or province. Link to the SNLEVEL table. Integer 2
CPOSTAL POSTAL_CODE. Postal code (e.g., US zip code). Char 10
CNTRYID ICIS_COUNTRY_CODE. Link to the COUNTRIES table. Integer 2
APHONE PHONE_NUMBER. Char 25
AFAX FAX_NUMBER. Char 25
AEMAIL EMAIL_ADDRESS. Char 40
SINS START_INST_DATE. Date the institution was founded. Format: YYYMMDD. Integer 4
EINS END_INST_DATE. Date the institution was closed or re-named. YYYMMDD. Integer 4

COLLECTING_MISSION (COLLMISS) (add to local)

New table representing a collecting mission, undertaken in a target country by a team of collectors.

Comment: could be represented by a list with attributes. Should we do that instead?

Field name Description Type Length
MissID A unique ID for the collecting mission. Primary key. Number 2
MissDesc Description/title of the collecting mission Number 2
MissCounID Country ID in which the collecting mission was conducted Number 2
MissStart Start date of the collecting mission. YYYYMMDD Number 4
MissEnd End date of the collecting mission. YYYYMMDD Number 4
MissPDF Location of PDF file of mission report Text 50
MissRef ID of reference to the mission. Links to RefID in BibRefs Integer 2


Team_Members (TeamMemb) (add to local)

New table to the people and institutes involved in a collecting mission.


Field name Description Type Length
TMemberID A unique ID for team member record. Primary key Number 2
MissID ID for the collecting mission. Links to MissID in CollMiss Number 2
PersonID ID of the collector. Links to PersonID in Persons Number 2
InstitID ID of the collector’s institute. Links to InstitID in INSTITUT. Number 2


GID_Represents (GIDREP) (add to local)

New table to show what each GID represents. Should contain exactly 1 record for each record in GERMPLSM.

Comment: could be added as fields to GERMPLSM in central. Might not be necessary if data validation rules are followed very well.

Field name Description Type Length
GID The GID. Primary key. Number 4
Represents What type of germplasm is represented by the GID (and hence how to interpret GLOCN and GDATE). Lookup values in UDFLDS. 0 for unvalidated status of a GID. Number 2
VReason Text justification of why the GID is considered to represent the value in REPRESENTS Text 255
VTimeStamp Date and time of validation DateTime
VRef Reference document used for validation. Links to RefID in BibRefs Number 4
VUserID UserID of the person responsible for validating Number 2

Valid_Codes (VALIDCODE) (add to local)

New table to identify valid germplasm naming systems.

Use case: germplasm managers typically assign names to be used as IDs for their germplasm samples, often in the form of an alphabetic prefix followed by a sequential number. In some cases, there may be a suffix after the prefix-number combination; and hyphens may or may not be used in the suffix. In such cases the managing institution can usually be deduced from the code. For example, “PI 123456” is a name assigned by USDA to an accession in the National Plant Genetic resources system of the USA.

Usage: if a name begins with a code, this table should be used to validate NTYPE and NLOCN. Codes may be duplicated if they have been used in more than one system.


Field name Description Type Length
CodeID A unique ID internally assigned to identify the record. Primary key. Number 2
NamePref A code used at the beginning of a name. Text 10
Suffix Boolean indicating whether a non-numeric suffix is used after Code-Number Boolean 2
Hyphen Boolean indicating whether hyphens are used in the suffix. Boolean 2
CLocID LOCID of the organization assigning names in this format Number 4
CType Type of germplasm identified by this name. Look up UDFLDS for FTYPE REPRESENTS (FTABLE GIDIS) Number 2

New records

UDFLDS

See uploaded file media:UDFLDS_GRIMS_Changes_2009.xls

Personal tools