This is an old revision of the document!
AeroCom data submission requirements
SUBMISSION OF DATA PROCEDURE
Any submission of data to the AeroCom database requires to follow the following steps: 1) format data to agreed netCDF htap standards, cf compliant 2) adjust filenames 3) check format with cf compliance checker, correct and check again 4) obtain account on htap server to access ftp and/or aerocom-users server 5) transfer data to the ftp server at the Norwegian Meteorological Institute 6) notify metno that data have been submitted 7) log submission in log file on htap wiki page 8) update errors on submission log on htap wiki page
1) FORMAT DATA
Each variable should be stored in ONE cf-compliant netCDF file, which includes all time steps for a given year. The list of variables to be output is provided in a spreadsheet per experiment and a description is provided in the experiment specifications document on the same wiki page.
CF-compliance implies that sufficient attributes, coordinates and coordinate describing variables are included in each file to describe metadata needed to understand grid representation, units and origin of the data. See excel htap protocol file for variable names, standard names (comming end of September 2013).
See also frequently asked questions on CF: URL-link
Formatting can be achieved with cmor tools (http://www-pcmdi.llnl.gov/software-portal/cmor/) and nco tools (http://nco.sourceforge.net/nco.html). For using cmor tools it is required to obtain the htap-2 cmor tables (to be provided shortly).
An example file as used for the previous AeroCom experiment is found here: http://aerocom.met.no/download/CMOR/FILE-EXAMPLE/ (htap example files will follow soon )
2) ADJUST FILENAMES
The obligatory Filename for using the cf checker and the submission of data is
“aerocom3_<ModelName>_<ExperimentName>_<VariableName>_<VerticalCoordinateType>_<Period>_<Frequency>.nc”
OLD aerocom naming convention is also fine for now:
“aerocom_<ModelName>_<ExperimentName>_<VariableName>_<Period>_<Frequency>.nc”
<ModelName> ⇒ can be chosen such that Model Name, Model version and possibly the institution can be identified. No underscores (_) are allowed in <!ModelName>, use (-) instead. Restrict <ModelName> to max 20 characters.
<ExperimentName> ⇒ see Experiment Specifications
<VariableName> ⇒ see output variable names in excel sheet
<VerticalCoordinateType> ⇒ “Surface”, “Column”, “ModelLevel”, “SurfaceAtStations”, “ModelLevelAtStations”
<Period> ⇒ eg “2008”, “2010”
<Frequency> ⇒ “timeinvariant”,“hourly”, “daily”, “monthly”, “sat1000”, “sat1330”, “sat2200”, “sat0130”
3) CHECK FORMAT
Typical files must be checked prior to submission via the cf-compliance-checker. URL http://aerocom-test.met.no/upload (will be changed to http://cfchecker.met.no shortly)
If you want to check a typical file and rather want to use a smaller file, you can cut out the first time steps with an nco command: ncks -O -d time,0,10 file.nc filetest.nc
Read also there the help section. Note: Errors indicate section in the original CF convention document.
Typical files? A check shall comprise at least one file of each VerticalCoordinateType and each Frequency and each experiment, best varying the variables!
4) OBTAIN ACCOUNT
…for METNO aerocom servers and WIKI documentation server
An account at Met.No is needed to submit data via ftp and edit AeroCom wiki pages. A data server (aerocom-users.met.no) allows users also to access the data via ssh. Please contact jan.griesfeller@met.no or Anna (annac@met.no) for password and user name. You will then be notified further about the procedure. A public ssh-key has to be sent to Jan Griesfeller to activate the account.
5) TRANSFER DATA
After obtaining an account, data can then be uploaded in two ways:
Preferred path: via aerocom-users.met.no: make directory under /media/scratch/incoming/AEROCOM_AUTO_UPLOAD/<ModelName>/renamed/ Use exactly the same model name as used for file names. Attention to lower&upper case!!
Put files in that renamed directory. Overall size limit of ca 500 GB ( limit for all users)
via ftp.aerocom.met.no: make your directory under /incoming/ and put files there. Overall size limit of ca 500 GB ( limit for all users)
Large submissions (> 10 GB) may require extra attention (use rsync, scftp, not simple ftp clients). They might require prior coordination with Met.No/Anna Benedictow/Jan Griesfeller/Michael Schulz via email, in particular in times of simultaneous heavy upload activity.
The directory /media/scratch/incoming/AEROCOM-P3-AUTO-UPLOAD allows for automatic incorporation into the AeroCom database and workup. Uploaded files are processed automatically by the AeroCom tools and transfered into the AeroCom phase III data directory.
If correct in format and with correct filenames, results uploaded here will be processed over night and appear after a day as image catalogue on http://aerocom.met.no/cgi-bin/aerocom/surfobs_annualrs.pl?PROJECT=AEROCOM&MODELLIST=AEROCOM-Phase-III
6) NOTIFY MET.NO
When files are uploaded send email to jan.griesfeller@met.no AND michael.schulz@met.no and annac@met.no, so that the files can be transferred from the ftp disk to the database. When the data are included in the database the files appear on aerocom-users.met.no under /metno/aerocom/users/aerocom1/<!ModelName>.
After the data are transferred to the database, data are removed from ftp server or aerocom-users:scratch server.
7) LOG FILE SUBMISSION
FOR EACH SUBMISSION: Put an entry into the htap wiki page Log file, describing what has been submitted. This is also obligatory if changes or updates (even partial) to the original submission are made. We also request modelers to record on the wiki information on known bugs, or any other relevant information needed for analysis of the model results by others.
7) UPDATES
In case of updates of files, the model version identifier should also contain information on the resubmission of model results. Additional files which do not change an earlier submission may contain the same version number as an older submission. Corrections and a model rerun should be accompanied by a new model version identifier. Old still valid files may be merged upon request with new data. Old versions of model submissions will be temporally moved to a separate directory by Met.No, and where possible, deleted after some time. An exception will be made for versions that have been used in publications. Modellers are responsible for communication of major updates to their models (e-mail and log-files, see below). Model analysers are responsible for frequently verifying changes in model submissions (e.g. before submitting a paper).