The SEG-Y Format for Geophysical Data


Based on Digital Tape Standards published by the Society of Exploration Geophysicists (SEG)


SEG-Y

The SEG-Y format is one of several tape standards developed by the Society of Exploration Geophysicists (SEG). It is the most common format used for seismic data in the exploration and production industry. However, it was created in 1973 and many different 'modernized' flavors exist.

SEG-Y was designed for storing a single line of seismic data on IBM 9-track tapes attached to IBM mainframe computers. Most of the variations in modern SEG-Y varieties result from trying to overcome these limitations.

Some of the features of SEG-Y which are outdated today include:

The official standard SEG-Y consists of the following components:

As mentioned earlier there are many variations of the standard.


The SEG-Y EBCDIC Reel Header

The EBCDIC reel header is equivalent to 40 IBM punch-cards (EBCDIC? punchcards? Welcome to the 70's, man!). The official layout of these 80-character cards is the EBCDIC equivalent of the following:

The blank spaces in the cards are fill-in-the-blanks. For example, the client's name is intended to go in the space after 'CLIENT' in the first card. Multiple-choice entries like 'SAMPLE CODE' in card 8 are intended to have the appropriate choice (such as 'FLOATING PT') marked with an 'X'.

Cards 21 through 40 are intended for general descriptions such as the data set's processing history.


The SEG-Y Binary Reel Header

The binary reel header contains much information about the data. Much of this information is optional, that is, the entire header is not required to be valid. In fact, none of it is required to be valid, although some fields are strongly recommended.

The 400 bytes contain 2-byte and 4-byte integers in the following layout:

* strongly recommended


The SEG-Y Trace Header

The 240-byte binary trace header consists of 2-byte and 4-byte integers in the following layout:

* strongly recommended


The SEG-Y Trace Data

Seismic data is acquired by generating a loud sound at one location and recording the resulting rumblings at another location.

The source or shot which generates the sound is typically an explosion or vibration at the Earth's surface (land or sea). Each shot is recorded by many receivers. Generally a line of shots is fired. If one line is recorded, the data is a 2D survey, and if more than one line is recorded, the data is a 3D survey.

The object of recording is to infer geological subsurface structure from the strength (amplitude) of the recorded signal at different times in the recording.

A trace begins life as the recording from one receiver. The recording is sampled at some discrete interval, typically around 4 milliseconds, and lasts for some duration, typically 4 or more seconds. After the initial recording, the traces are processed in any number of ways. This processing usually changes the absolute amplitudes such that amplitude units are irrelevant, and only relative amplitudes are significant. Also the trace may reflect a logical ordering different from the original (shot,receiver) pair.

But in the end, seismic data is almost always stored as a sequence of traces, each trace consisting of amplitude samples for one location (physical or logical).


SEG-Y Variations

Many variations of SEG-Y exist, most created to overcome SEG-Y's limitations.

The EBCDIC reel header is usually completely ignored, and when it is used, it may or may not follow the standard template, and it may even be in ASCII format.

The binary reel header is almost completely ignored. None of the fields should be assumed to be correct, although the number of samples per trace and the sample rate usually are. Often, programs that use the SEG-Y format will read values from the binary header by default but allow the user to override the header values.

The trace header contains important information but not always in the locations specified by the standard. To adapt SEG-Y for 3D surveys, a line number field is often added somewhere in the trace header. Programs that use values from the SEG-Y trace header usually allow the user to specify the byte location and length of the values.

The trace data is most often in 32-bit IBM floating point format. Occasionally 32-bit IEEE floating-point format is used.

Although the standard applies only to tapes, SEG-Y has been adapted for storing surveys on disk as well. Disk files have no record marks or file marks, so traditional methods of reading from tapes don't work with files. There are several SEG-Y disk adaptations: a binary file with 3200-byte and 400-byte headers followed by traces, a binary file with just traces, and Fortran sequential-access files which have 3200-byte and 400-byte headers followed by traces but with Fortran record marks separating them. A similar and common format is a flat file of trace data with no reel or trace headers.