aerocom:data_submission

Differences

This shows you the differences between two versions of the page.

Link to this comparison view

Both sides previous revision Previous revision
Next revision
Previous revision
aerocom:data_submission [2015-08-10 11:47:20]
michaels
aerocom:data_submission [2022-05-31 09:29:31] (current)
Line 1: Line 1:
 +ATTENTION - THIS WIKI PAGE IS NO LONGER UPDATED - PLEASE GO TO [[http://aerocom.met.no/|aerocom.met.no]]FOR LATEST INFO
 +
 +
 AeroCom  data submission requirements AeroCom  data submission requirements
  
-==== SUBMISSION OF DATA PROCEDURE ==== +====== SUBMISSION OF DATA PROCEDURE ====== 
-Any submission of data to the AeroCom database requires to follow the following steps: 1) [[#FORMAT DATA|format data]] to agreed netCDF htap standards, cf compliant  2) [[#ADJUST FILENAMES |adjust filenames]]  3) [[#CHECK FORMAT|check format]] with cf compliance checker, correct and check again  4) [[#OBTAIN ACCOUNT|obtain account]] on htap server to access ftp and/or aerocom-users server 5) [[#TRANSFER DATA|transfer data]] to the ftp server at the Norwegian Meteorological Institute 6) [[#NOTIFY MET.NO|notify metno]] that data have been submitted  7) [[#LOG FILE SUBMISSION|log submission]] in [[aerocom:modelupdates|log file]] on htap wiki page 8) [[#UPDATES|update errors]] on submission log on htap wiki page +Any submission of data to the AeroCom database requires to follow the following steps:  
 +  - [[#FORMAT DATA|format data]] to agreed netCDF htap standards, cf compliant   
 +  - [[#ADJUST FILENAMES |adjust filenames]] 
 +  [[#CHECK FORMAT|check format]] with cf compliance checker, correct and check again 
 +  [[#OBTAIN ACCOUNT|obtain account]] on htap server to access aerocom-users server  
 +  - [[#TRANSFER DATA|transfer data]] to the aerocom-users.met.no at the Norwegian Meteorological Institute  
 +  - [[#NOTIFY MET.NO|notify metno]] that data have been submitted   
 +  - [[#LOG FILE SUBMISSION|log submission]] in [[aerocom:modelupdates|log file]] on htap wiki page  
 +  - [[#UPDATES|update errors]] on submission log on htap wiki page 
  
  
-=== 1) FORMAT DATA === +===== 1) format data ===== 
-Each variable should be stored in ONE cf-compliant netCDF file, which includes all time steps for a given year.  The list of variables to be output is provided in a spreadsheet per experiment and a description is provided in the experiment specifications document on the same wiki page.+Each variable should be stored in ONE cf-compliant NetCDF file, which includes all time steps for a given year.  The list of variables to be output is provided in a spreadsheet per experiment and a description is provided in the experiment specifications document on the same wiki page.
  
-CF-compliance implies that sufficient attributes, coordinates and coordinate describing variables are included in each file to describe metadata needed to understand grid representation, units and origin of the data. See excel htap protocol file for variable names, standard names (comming end of September 2013).+CF-compliance implies that sufficient attributes, coordinates and coordinate describing variables are included in each file to describe metadata needed to understand grid representation, units and origin of the data. 
  
 See also frequently asked questions on CF: [[https://github.com/graybealski/cf-conventions-work/blob/master/FAQ.md|URL-link]] See also frequently asked questions on CF: [[https://github.com/graybealski/cf-conventions-work/blob/master/FAQ.md|URL-link]]
  
-Formatting can be achieved with cmor tools (http://www-pcmdi.llnl.gov/software-portal/cmor/) and nco tools ([[http://nco.sourceforge.net/nco.html|URL-link]]). For using cmor tools it is required to obtain the htap-2 cmor tables: [[http://iek8wikis.iek.fz-juelich.de/HTAPWiki/WP2.2#A3._HTAP2_model_output_files|URL-link to htap wiki with link to CMOR tables]]). 
  
-An example file as used for the previous AeroCom experiment is found herehttp://aerocom.met.no/download/CMOR/FILE-EXAMPLE/ (htap example files will follow soon )+===== 2) adjust filenames ===== 
 +The obligatory filename for using the cf checker and the submission of data is:\\ 
 +**aerocom3_<ModelName>-<MeteoConfigSpecifier>_<ExperimentName>-<PerturbationName>_<VariableName>_<VerticalCoordinateType>_<Period>_<Frequency>.nc**
  
 +Details about new file naming can be found here:
 +[[https://docs.google.com/spreadsheets/d/1NiHLVTDsBo0JEBSnnDECNI2ojUnCVlxuy2PFrsRJW38/edit?usp=sharing | AeroCom experiments diagnostics sheets 28.2.2019]]
  
 +Be aware of updates ! \\
  
-=== 2) ADJUST FILENAMES === +versions will have a date attached.\\
-The obligatory Filename for using the cf checker and the submission of data is+
  
-"aerocom3_<ModelName>_<ExperimentName>_<VariableName>_<VerticalCoordinateType>_<Period>_<Frequency>.nc"+Examples: 
 +  * aerocom3_GOCARTv7.2_AP3-CTRL_od550aer_Column_2010_monthly.nc 
 +  * aerocom3_GISS-MATRIX-met2010_AP3-ACRI-AFRICA_od550aer_Column_2010_daily.nc 
 +  * aerocom3_GOCARTv7.2-met2010_AP3-CTRL-od550aer_Column_1850_monthly.nc 
 +  * aerocom3_GISS-MATRIX-fSST_AP3-CTRL-od550aer_Column_1850_monthly.nc
  
-OLD aerocom naming convention is also fine for now: +OLD (aerocom phase3)  
 +**aerocom3_<ModelName>_<ExperimentName>_<VariableName>_<VerticalCoordinateType>_<Period>_<Frequency>.nc**
  
-"aerocom_<ModelName>_<ExperimentName>_<VariableName>_<Period>_<Frequency>.nc"+OLD (aerocom phase 2) aerocom naming convention is also fine for now:  
 +**aerocom_<ModelName>_<ExperimentName>_<VariableName>_<Period>_<Frequency>.nc**
  
 +  * <ModelName> => can be chosen such that Model Name, Model version and possibly the institution can be identified.  No underscores (_) are allowed in <!ModelName>, use (-) instead. Restrict <ModelName> to max 20 characters.
 +  * <MeteoConfigSpecifiers> => "met2010" (default, nudged to 2010 meteo) / "fSST" (fixed 2010 SST monthly fields, not nudged) / "CPL" (coupled AOGCM model, not nudged) A MeteoSpecifier has to be present
 +  * <ExperimentName> => see  Experiment Specifications 
 +  * <PerturbationName> see experiment descriptions on aerocom wiki, if required for the experiment, use exact case sensitive name from description
 +  * <VariableName> => see output variable names in excel sheet
 +  * <VerticalCoordinateType> => "Surface", "Column", "ModelLevel", "SurfaceAtStations", "ModelLevelAtStations"
 +  * <Period> => eg "2008", "2010"
 +  * <Frequency> => "timeinvariant","hourly", "3hourly", "daily", "monthly", "sat1000", "sat1330", "sat2200", "sat0130"
  
-<ModelName> => can be chosen such that Model Name, Model version and possibly the institution can be identified.  No underscores (_) are allowed in <!ModelName>, use (-) instead. Restrict <ModelName> to max 20 characters. 
  
-<ExperimentName> => see  Experiment Specifications +===== 3) check format ===== 
 +Typical files must be checked prior to submission via a cf-compliance-checker. [[http://cfconventions.org | cfconventions.org]]  maintains a list of working and frequently updated [[http://cfconventions.org/compliance-checker.html|CF checkers]]. Some come as a script, and to others one can upload a file. 
  
-<VariableName> => see output variable names in excel sheet+For the upload checker you want to check a typical but small file. To achieve this, you can cut out the first time steps with the nco command: 
 +ncks -O -d time,0,10 sourcefile.nc filetest.nc
  
-<VerticalCoordinateType> => "Surface", "Column", "ModelLevel", "SurfaceAtStations""ModelLevelAtStations"+Per default, a check shall comprise at least one file of each VerticalCoordinateType and each Frequency and each experimentbest varying the variables!
  
-<Period> => eg "2008", "2010" 
  
-<Frequency> => "timeinvariant","hourly", "daily", "monthly", "sat1000", "sat1330", "sat2200", "sat0130" +===== 4obtain account =====
- +
- +
-=== 3) CHECK FORMAT === +
-Typical files must be checked prior to submission via the cf-compliance-checker.  URL http://aerocom-test.met.no/upload (will be changed to http://cfchecker.met.no shortly) +
- +
-If you want to check a typical file and rather want to use a smaller file, you can cut out the first time steps with an nco command: +
-ncks -O -d time,0,10 file.nc filetest.nc +
- +
-Read also there the help section. Note: Errors indicate section in the original +
-[[http://cf-pcmdi.llnl.gov/documents/cf-conventions/1.6/cf-conventions.html|CF convention document]]. +
- +
-Typical files? A check shall comprise at least one file of each VerticalCoordinateType and each Frequency and each experiment, best varying the variables! +
- +
- +
-=== 4) OBTAIN ACCOUNT ===+
  
 ...for METNO aerocom servers and WIKI documentation server  ...for METNO aerocom servers and WIKI documentation server 
  
-An account at Met.No is needed to submit data via ftp and edit [[aerocom:|AeroCom wiki pages]]. A data server (aerocom-users.met.no) allows users also to access the data via ssh. Please contact jan.griesfeller@met.no or Anna (annac@met.nofor password and user name. You will then be notified further about the procedure. A public ssh-key has to be sent to Jan Griesfeller to activate the account. +An account at Met.No is needed to submit data a data server (aerocom-users.met.no) via ssh and edit [[aerocom:|AeroCom wiki pages]]. Please contact jan.griesfeller@met.no and/or anna.benedictow@met.no for password and user name. You will then be notified further about the procedure. A public ssh-key has to be sent to jan and/or anna to activate the account.
- +
- +
-=== 5) TRANSFER DATA === +
-After obtaining an account, data can then be uploaded in two ways: +
- +
-Preferred path: via aerocom-users.met.no: make directory under /media/scratch/incoming/AEROCOM_AUTO_UPLOAD/<ModelName>/renamed/ +
-Use exactly the same model name as used for file names. Attention to lower&upper case!!  +
- +
-Put files in that renamed directory. Overall size limit of ca 500 GB ( limit for all users)    +
- +
-via ftp.aerocom.met.no: make your directory under /incoming/ and put files there. Overall size limit of ca 500 GB ( limit for all users)  +
  
-Large submissions (> 10 GB) may require extra attention (use rsync, scftp, not simple ftp clients). They might require prior coordination with Met.No/Anna Benedictow/Jan Griesfeller/Michael Schulz via email, in particular in times of simultaneous heavy upload activity.  
  
-The directory /media/scratch/incoming/AEROCOM-P3-AUTO-UPLOAD allows for automatic incorporation into the AeroCom database and workupUploaded files are processed automatically by the AeroCom tools and transfered into the AeroCom phase III data directory.+===== 5) transfer data ===== 
 +After obtaining an account, data can then be uploaded to the aerocom user server aerocom-users.met.no: 
 +  * Create a directory under /media/scratch/incoming/AEROCOM_AUTO_UPLOAD/<ModelName>/renamed/\\ Use exactly the same model name as used for file namesAttention to lower&upper case!!\\ Put files in that renamed directory. Overall size limit of ca 6 TB (limit for all users)    
 +  * upload via ftp.aerocom.met.no has been discontinued due to security concerns. 
 +  
  
-If correct in format and with correct filenames, results uploaded here will be processed over night and appear after a day as image catalogue on http://aerocom.met.no/cgi-bin/aerocom/surfobs_annualrs.pl?PROJECT=AEROCOM&MODELLIST=AEROCOM-Phase-III+Large submissions (> 100 GB) may require extra attention (use rsync). They might require prior coordination with Met.No/Anna Benedictow/Jan Griesfeller/Michael Schulz via email, in particular in times of simultaneous heavy upload activity
  
 +On aerocom-users.met.no the directory /media/scratch/incoming/AEROCOM-P3-AUTO-UPLOAD allows for automatic incorporation into the AeroCom database and workup. Uploaded files are processed automatically by the AeroCom tools and transfered into the AeroCom phase III data directory. (Still in test phase - send also e-mail (ref. [[#NOTIFY MET.NO|notify metno]]))
  
-=== 6) NOTIFY MET.NO === +If correct in format and with correct file namesresults uploaded here will be processed over night and appear after a day as image catalogue on http://aerocom.met.no/cgi-bin/aerocom/surfobs_annualrs.pl?PROJECT=AEROCOM&MODELLIST=AEROCOM-Phase-III
-When files are uploaded send email to jan.griesfeller@met.no AND michael.schulz@met.no and annac@met.noso that the files can be transferred from the ftp disk to the database. When the data are included in the database the files appear on aerocom-users.met.no under /metno/aerocom/users/aerocom1/<!ModelName>.+
  
-After the data are transferred to the database, data are removed from ftp server or aerocom-users:scratch server. 
  
 +===== 6) notify met.no =====
 +When files are uploaded send email to anna.benedictow@met.no and jan.griesfeller@met.no and michael.schulz@met.no (AND coordinator of project), so that the files can be transferred from aerocom-users.met.no to the database. When the data are included in the database the files appear on aerocom-users.met.no under /metno/aerocom-users-database/PROJECT(AEROCOM-PHASE-III)/<ModelName>_<exp>/.
  
 +After the data are transferred to the database, data are removed from aerocom-users:scratch server.
  
  
-=== 7) LOG FILE SUBMISSION === +===== 7) submit a log file ===== 
-FOR EACH SUBMISSION: Put an entry into the htap wiki page [[modelupdates|Log file]], describing what has been submitted. This is also obligatory if changes or updates (even partial) to the original submission are made. We also request modelers to record on the wiki information on known bugs, or any other relevant information needed for analysis of the model results by others.+FOR EACH SUBMISSION: Put an entry into the htap wiki page [[modelupdates|Log file]], describing what has been submitted. This is also obligatory if changes or updates (even partial) to the original submission are made. We also request modellers to record on the wiki information on known bugs, or any other relevant information needed for analysis of the model results by others.
  
  
-=== 7UPDATES ===+===== 8updates =====
  
 In case of updates of files, the model version identifier should also contain information on the resubmission of model results. Additional files which do not change an earlier submission may contain the same version number as an older submission. Corrections and a model rerun should be accompanied by a new model version identifier. Old still valid files may be merged upon request with new data. Old versions of model submissions will be temporally moved to a separate directory by Met.No, and where possible, deleted after some time. An exception will be made for versions that have been used in publications. Modellers are responsible for communication of major updates to their models (e-mail and log-files, see below). Model analysers are responsible for frequently verifying changes in model submissions (e.g. before submitting a paper). In case of updates of files, the model version identifier should also contain information on the resubmission of model results. Additional files which do not change an earlier submission may contain the same version number as an older submission. Corrections and a model rerun should be accompanied by a new model version identifier. Old still valid files may be merged upon request with new data. Old versions of model submissions will be temporally moved to a separate directory by Met.No, and where possible, deleted after some time. An exception will be made for versions that have been used in publications. Modellers are responsible for communication of major updates to their models (e-mail and log-files, see below). Model analysers are responsible for frequently verifying changes in model submissions (e.g. before submitting a paper).
  
                
  • aerocom/data_submission.1439207240.txt.gz
  • Last modified: 2022-05-31 09:23:10
  • (external edit)