Exporting the Structure

Introduction

Critic2 can be used as a converter between different crystal structure file formats. For instance, to convert a cif file to a QE input, we can use:

CRYSTAL myfile.cif
WRITE myfile.scf.in

Sometimes, it is also necessary to create a finite representation of a crystal by taking the crystal motif, perhaps extended with some of atoms in the neighboring unit cells. The simplest way of doing this is by writing an xyz file:

CRYSTAL myfile_DEN
WRITE myfile.xyz

The MOLMOTIF keyword writes all atoms in the unit cell and completes the molecules by using atoms in the neighboring cells. One of the important aims of WRITE is to write template input files for different programs. The particular keywords used in those templates (calculation level, basis set, etc.) are mostly meaningless but the structure is correct. It is up to the user to adapt these templates to suit their needs.

In critic2, the main keyword to export the molecular or crystal structure, or a transformation of those structures, is the WRITE keyword.

Writing the Structure (WRITE)

WRITE file.{xyz,gjf,cml} [ix.i iy.i iz.i] [BORDER]
      [SPHERE rad.r [x0.r y0.r z0.r]] [CUBE side.r [x0.r y0.r z0.r]]
      [MOLMOTIF] [ONEMOTIF] [ENVIRON dist.r] [NMER nmer.i]
WRITE file.{obj,ply,off} [ix.i iy.i iz.i] [BORDER]
      [SPHERE rad.r [x0.r y0.r z0.r]] [CUBE side.r [x0.r y0.r z0.r]]
      [MOLMOTIF] [ONEMOTIF] [CELL] [MOLCELL]
WRITE file.scf.in [rklength.r]
WRITE file.pwi [rklength.r]
WRITE file.tess
WRITE file.cri|file.incritic
WRITE {[file.]POSCAR|[file.]CONTCAR|file.vasp}
WRITE file.abin
WRITE file.elk
WRITE file.gau
WRITE file.cif [NOSYM|NOSYMM]
WRITE file.d12 [NOSYM|NOSYMM] [NOEXTERNAL]
WRITE file.m
WRITE file.db
WRITE file.gin
WRITE file.lammps
WRITE file.cell [rklength.r]
WRITE file.fdf
WRITE file.STRUCT_IN
WRITE file.hsd
WRITE file.gen
WRITE file.pyscf
WRITE file.in [rklength.r] ## FHIaims geometry.in
WRITE file.frac
WRITE file.pdb

The WRITE keyword writes the currently loaded structure to a file. A number of file formats can be written by critic2. As in CRYSTAL and MOLECULE, the type of file is detected by the extension (.xyz, .in, .cri, etc.).

Molecular File Formats (xyz, gjf, cml)

With this command:

WRITE file.{xyz,gjf,cml} [ix.i iy.i iz.i] [BORDER]
      [SPHERE rad.r [x0.r y0.r z0.r]] [CUBE side.r [x0.r y0.r z0.r]]
      [MOLMOTIF] [ONEMOTIF] [ENVIRON dist.r] [NMER nmer.i]

WRITE generates an xyz file containing a finite piece of the crystal (if the structure was loaded with CRYSTAL) or the molecule (resp. MOLECULE). Alternatively, if the .gjf extension is used, a template for a Gaussian input file is written. If cml is used, a Chemical Markup Language file (xml-style) is created, containing the same molecular fragment (see below). The number of cells used in each direction is given by ix.i, iy.i, and iz.i (default: 1, 1, 1). For the purpose of its graphical representation, it is sometimes convenient to include atoms that are almost exactly at the edge of the cell. For instance, the NaCl crystal is:

CRYSTAL
  SPG f m -3 m
  CELL 5.64 5.64 5.64 90 90 90 ANG
  NEQ 0. 0. 0. na
  NEQ 1/2 1/2 1/2 cl
ENDCRYSTAL
WRITE nacl.xyz

Critic2 will (correctly) generate a list of 4 Na and 4 Cl atoms, representing 1/8th of the conventional cell, because the atoms at (1, 0, 0), (1, 1/2, 0), etc. are repetitions of the atoms in the main cell. However, this does not look good when the unit cell is represented because many of the atoms in the cubic cell are “missing”. The BORDER keyword instructs critic2 to include atoms that are (almost) exactly at the edge of the cell.

The SPHERE keyword writes all atoms inside a sphere of radius rad.r (bohr) and centered around the crystallographic coordinates (x0.r, y0.r, z0.r). In molecules, the default units for both the center and radius of the sphere are Cartesian in angstrom. If no center is given, (0,0,0) is used in both cases. The similar keyword CUBE writes all atoms inside a cube of side side.r centered around (x0.r, y0.r, z0.r) (default: (0,0,0)).

The keyword MOLMOTIF is used in molecular crystals. All atoms in the requested crystal fragment (indicated by the optional ix.i,… integers) are written to the xyz file. Then, the molecules in the fragment are completed by including atoms from outside the fragment. Critic2 detects whether the atomic connectivity using a distance criterion (see the BONDFACTOR and RADII keywords).

The ONEMOTIF and ENVIRON keywords are also used in molecular crystals. ONEMOTIF writes all atoms in the unit cell, translated by lattice vectors so that the resulting fragment has whole molecules only. ENVIRON writes the molecular environment of the unit cell origin up to a distance renv.r (bohr in crystals, angstrom in molecules). All molecules whose center of mass is at a distance less than renv.r are written in their entirety, even if some of their atoms exceed the renv.r distance from the origin.

The NMER keyword is used in molecular systems as well, and in combination with ONEMOTIF, MOLMOTIF, or ENVIRON. When NMER is given, the fragment of the system selected with either of those three keywords is split into its component molecules. Then, all monomers, dimers, trimers,… are written to separate files. All n-mers are written from monomers up to n-mers, where n is equal to nmer.i. In NMER is used with ENVIRON, the first molecule in all n-mers except for those with n = nmer.i is always part of the Wigner-Seitz cell, which is useful when generating molecular environments of the crystal for calculations using incremental methods.

There is an important application of the xyz-format WRITE keyword: the coordinates written to the xyz file are consistent with the transformation to Cartesian coordinates in critic2, so it is possible to bring back all or part of these coordinates to critic2 in order to represent a subset of the atoms in a crystal. This is very useful when generating fragments for an NCIPLOT calculation (see the FRAGMENT keyword) and in some LOAD options to obtain the promolecular density of a subset of the atoms. Whether the contents of an xyz file are recognized by critic2 as atoms belonging to the current system or not can be determined using the IDENTIFY keyword.

The CML (Chemical Markup Language) format has the same options as the xyz output format. In the CML format, an XML-style file is written containing the selected crystal fragment. If the system is a crystal (loaded with the CRYSTAL keyword), then the cell geometry is written to the CML file as well. The CML output format is specially tailored for being easy to read by avogadro and its underlying engine, openbabel.

Graphical File Formats (obj, ply, off)

The following keyword also writes finite molecular representations of the structure:

WRITE file.{obj,ply,off} [ix.i iy.i iz.i] [BORDER]
      [SPHERE rad.r [x0.r y0.r z0.r]] [CUBE side.r [x0.r y0.r z0.r]]
      [MOLMOTIF] [ONEMOTIF] [CELL] [MOLCELL]

In this case, however, the generated files are graphical representations.

The OBJ output is the Wavefront OBJ format. The OBJ format is a three-dimensional model representation, that is, it uses vertices and faces instead of atoms. This file format is understood by many visualizers such as view3dscene, meshlab, blender, and others. The keywords have the same meaning as in the xyz format. The additional CELL keyword instructs critic2 to write a stick representation of the unit cell. In a molecular structure, the MOLCELL keyword can be used to represent the molecular cell. The similarly popular PLY (polygon file format or Stanford triangle format) and OFF (Geomview) file formats can be used as well, with the same options.

Quantum ESPRESSO (scf.in, pwi)

Quantum ESPRESSO inputs can be written using the extensions .scf.in or .pwi. Critic2 uses ‘ibrav=0’ always, and writes a CELL_PARAMETERS block containing the crystallographic-to-Cartesian transformation matrix. QE is particular about how this matrix should written in order for its own symmetry module to work. If the crystal setting matches any of those covered in the QE manual, then that particular matrix is used. Otherwise, critic2 uses its own internal CELL_PARAMETERS matrix, which may result in Quantum ESPRESSO failing to recognize the crystal symmetry.

By default, the crystal cell used by critic2 is written to the QE input template. To reduce the cell to a primitive, use NEWCELL with the PRIMITIVE keyword before writing the file. The optional parameter to the QE input writer (rklength.r) is the length parameter (rk-length) that determines the density of the k-point grid. See the KPOINTS keyword.

Tessel (tess)

A tessel input file (extension .tess) can be written. Tessel is a program for graphical representations of crystals, available upon request.

Critic2 (cri, incritic)

A critic2 input file can be written using the .cri or .incritic extensions. The input file contains the description of the molecular or crystal structure using the CRYSTAL/MOLECULE environment.

VASP (POSCAR, CONTCAR, vasp)

A VASP POSCAR (or CONTCAR or file with extension .vasp) is generated by using the POSCAR, CONTCAR, or vasp extension or name. The list of atomic types is written to the critic2 output. This list is necessary to build the corresponding POTCAR. The atoms are always ordered in increasing atomic number.

abinit (abin)

An abinit input file containing the input structure can be written by using the .abin extension.

elk (elk)

An elk input template can be written using the .elk extension.

Crystallographic Information File (cif)

A simple cif file is generated if the .cif extension is used. If NOSYM (or NOSYMM) is used, the cif file is written without symmetry (i.e. in the P1 space group).

SHELX (res)

The .res extension writes a SHELX file in res format. If NOSYM (or NOSYMM) is used, the cif file is written without symmetry (i.e. in the P1 space group).

Crystal (d12)

A template input file for CRYSTAL14 and CRYSTAL17 can be written with the extension .d12. The default behavior is to write a template .d12 file and the geometry details as well as the symmetry information to a file with extension .fort.34 suitable for use with CRYSTAL’s EXTERNAL keyword. This file must be renamed to just fort.34 prior to running CRYSTAL. The .d12 file contains only a keyword to run a geometry test - the input file is complete but there is no basis set specification or any of the other relevant keywords.

Inputting symmetry to CRYSTAL is sometimes problematic because of the assumed crystallographic conventions. If the NOSYM or NOSYMM keyword is used, critic2 writes the structural information without symmetry entirely to the .d12 file (without the fort.34). If the NOEXTERNAL keyword is used, critic2 attempts to write a complete .d12 file (without fort.34) with symmetry. Because of how CRYSTAL interprets the input, this option does not always yield the same structure as the original, so it is strongly recommended that the provided input with the TESTGEOM keyword is run, and the CRYSTAL output is compared with the original structure using the COMPARE keyword.

Crystal17 (and probably earlier versions) has different tolerance values for detecting and accepting symmetry operations than critic2. It is strongly recommenteded that you use SYM REFINE before writing a .d12 file with symmetry in order to recalculcate the atomic positions in the detected space group.

Gaussian (periodic, gau)

A Gaussian input file for calculations under periodic boundary conditions can be written using the .gau extension. For a template corresponding to a finite molecule, use .gjf (see above).

Escher/Octave (m)

The octave script file (extension .m) contains the structure in octave format, to be read using the escher library.

DCP database (db)

The db file format is intended for a set of automated input generation octave scripts, the dcp package.

Gulp (gin)

A simple GULP template input file containing the structure (and EEM as the first line) can be written using the .gin extension. No resonant carbon atoms are detected. The GULP writer is experimental, so please exercise care and double-check the templates.

LAMMPS (lammps)

For file names with an extension .lammps, critic2 writes a simple LAMMPS data file containing one unit cell (length units are angstrom). Only orthogonal cells are supported for now. The LAMMPS writer is experimental, so please exercise care and double-check the templates.

CASTEP (cell)

With a .cell suffix in the file name, critic2 writes a CASTEP .cell input file. An additional optional real number (rklength.r) can be used to choose the size of the k-point grid. See the KPOINTS keyword for details.

SIESTA (fdf, STRUCT_IN)

Two types of siesta inputs can be generated. The .fdf extension writes a template for a proper functional siesta input template containing the crystal structure. The STRUCT_IN extension or name writes files that can be read using the MD.UseStructFile option.

DFTB+ (gen, hsd)

Two inputs types for DFTB+ may be written. The .gen format contains only the structure and is meant to be used with the GenFormat method in Geometry. The .hsd writes a full input template, including the structure.

pyscf (pyscf)

The .pyscf extension writes a template python script for using with pyscf. The script creates an instance of a Mole object (in the case of a molecule) or a Cell object (if it is a crystal).

FHIaims (in)

A geometry.in input file for FHIaims can be generated using the .in extension (note that using scf.in generates QE inputs). The file then needs to be renamed to geometry.in. The appropriate selection of keywords is used depending on whether the current structure is a crystal or a molecule.

If an additional real number is given (rklength.r), an additional file (file.in_control) containing a template control.in for the FHIaims calculation is written. The template control.in specifies a uniform k-point grid in the k_grid keyword with a density given by the length parameter rklength.r. See the KPOINTS keyword for details.

TINKER frac Files (frac)

A .frac file can be generated in TINKER format. The .frac file always contains the cell lengths and angles (even if a molecule is written) and the atomic connectivity calculated by critic2. The force field atom types in the file correspond to the “tiny” force field parameters in the TINKER distribution (tiny.prm).

Protein Data Bank Files (pdb)

The structure can be written in the Protein Data Bank (PDB) file format, with extension .pdb. The atomic positions are written using only the HETATM keyword corresponding to unknown residues. In the case of crystals, the cell vectors are written using the CRYST1 and SCALEn keywords.

Writing a .mols File for DMACRYS/NEIGHCRYS (MAKEMOLSNC)

MAKEMOLSNC file_fort.21.s file_mols.s

The MAKEMOLSNC keyword is used to write .mols files for NEIGHCRYS, the companion program to DMACRYS. DMACRYS is a program for the calculation of lattice energies and their derivatives using rigid molecules and the distributed multipole approach. The atomic multipoles are calculated from a gas-phase molecular wavefunction using the GDMA program

The NEIGHCRYS program is used to help in the preparation of input files for DMACRYS. Sometimes, NEIGHCRYS has difficulty generating the .mols file, a file that contains the local frame of reference for the calculation of the multipoles for every molecule in the crystal. (See also SYM WHOLEMOLS.)

The MAKEMOLSNC keyword is intended to be a replacement for this function in NEIGHCRYS. MAKEMOLSNC reads the NEIGHCRYS output (a fort.21 file) and generates the corresponding mols file with name file_mols.s. MAKEMOLSNC does not work if the molecules are linear because NEIGHCRYS does not work with them either. The use of MAKEMOLSNC does not require a molecular or crystal structure being loaded.

Updated: