pyphot.io package

pyphot.io package#

IO module#

This module provides functions for reading and writing data in various formats.

It is Adapted from SimpleTable (v2.0; mfouesneau/simpletable) with minimal depedencies.

Formats:

FITS: fits
ASCII: ascii
HDF5: hdf
VOTABLE: votable

For all formats, reading and writing preserve metadata. The data are given as pd.DataFrame objects and the header as a HeaderInfo object.

from_file(fname, *, format=None, **kwargs)[source]#

Read a file into a DataFrame and a Header

Parameters:

fname (str) – File name to read.
format (str, optional) – File format to read. If not provided, the format is inferred from the file extension.
**kwargs – Additional keyword arguments to pass to the reader.

Returns:

df (pd.DataFrame) – DataFrame containing the data.
hdr (HeaderInfo) – Header information.

Return type:

tuple[DataFrame, HeaderInfo]

Submodules#

pyphot.io.ascii module#

Export dataframe to ASCII format while preserving attrs

ascii_generate_header(df, comments='#', delimiter=' ', commented_header=True)[source]#

Generate the corresponding ascii Header that contains all necessary info

Parameters:

df (pd.DataFrame) – table to export
comments (str) – string to prepend header lines
delimiter (str, optional) – The string used to separate values. By default, this is any whitespace.
commented_header (bool, optional) – if set, the last line of the header is expected to be the column titles

Returns:

hdr – string that will be be written at the beginning of the file

Return type:

str

ascii_read_header(fname, *, commentchar='#', delimiter=',', commented_header=True, **kwargs)[source]#

Read ASCII/CSV header

Parameters:

fname (str, FilePath, BaseBuffer) – File, filename, or generator to read. Note that generators should return byte strings for Python >=3.
comments (str, optional) – The character used to indicate the start of a comment; default: ‘#’. (”” is equivalent to None)
delimiter (str, optional) – The string used to separate values. By default, this is any whitespace.
commented_header (bool, optional) – if set, the last line of the header is expected to be the column titles (with comment character) otherwise, the first line of the data will be the column titles
commentchar (str)

Returns:

nlines (int) – number of lines from the header
info (HeaderInfo) – header information (header, alias, units, comments)
names (List[str]) – sequence or str, first data line after header, expected to be the column names.

Return type:

Tuple[int, HeaderInfo, List[str]]

from_ascii(filepath_or_buffer, *, commented_header=False, **kwargs)[source]#

Read an ASCII file into a DataFrame.

from_csv with delimiter set to “ “ by default

Parameters:

filepath_or_buffer (str, path object or file-like object) – Any valid string path is acceptable. The string could be a URL. Valid URL schemes include http, ftp, s3, and file. For file URLs, a host is expected. A local file could be: file://localhost/path/to/table.csv.
commented_header (bool, default False) – Whether the header is commented or not.
**kwargs (dict) – Additional keyword arguments passed to pd.read_csv.

Returns:

DataFrame (pd.DataFrame) – The parsed data as a pd.DataFrame.
header (HeaderInfo) – The header information extracted from the file.

pyphot.io.fits module#

Module for reading and writing FITS files

Important

This module relies on astropy.io.fits

fits_generate_hdu(df, index=True)[source]#

Generate a FITS BinTableHDU from a DataFrame.

Parameters:

df (pd.DataFrame) – The DataFrame to convert.
index (bool, optional) – Whether to include the index in the table, by default True.

Returns:

The generated HDU.

Return type:

BinTableHDU

fits_generate_header(df)[source]#

Generate the corresponding fits Header that contains all necessary info

Parameters:: df (pd.DataFrame) – DataFrame or HeaderInfo instance
Returns:: hdr – header instance
Return type:: fits.Header

fits_read_header(hdr)[source]#

Convert pyfits header into dictionary with relevant values

Parameters:: hdr (pyftis.Header) – fits unit
Returns:: headerinfo – extracted information from header
Return type:: HeaderInfo

fix_endian_issue(arr)[source]#

Fix endian issue in array which happens often when reading FITS files

Parameters:: arr (ndarray[tuple[Any, ...], dtype[_ScalarT]] | Any)
Return type:: ndarray[tuple[Any, …], dtype[_ScalarT]]

from_fits(filename, extension_number=1)[source]#

Load a DataFrame from a FITS file.

Parameters:

filename (str) – The path to the FITS file.
extension_number (int, optional) – The extension number to load, by default 1.

Returns:

The loaded DataFrame and its header information.

Return type:

Tuple[npt.NDArray, HeaderInfo]

to_fits(df, filename, extension_number=1, header_info=None, output_verify='exception', checksum=False, index=True, overwrite=False, append=False, **kwargs)[source]#

Save a DataFrame to a FITS file.

Parameters:

df (pd.DataFrame) – The DataFrame to save. header information taken from data.attrs or header_info if provided
filename (str) – The path to the FITS file.
extension_number (int, optional) – The extension number to save, by default 1.
header_info (Optional[HeaderInfo], optional) – Header information to save with the FITS file by default None and taken from data.attrs override data.attrs if provided
output_verify (str) – Output verification option. Must be one of "fix", "silentfix", "ignore", "warn", or "exception". May also be any combination of "fix" or "silentfix" with "+ignore", +warn, or +exception" (e.g. ``"fix+warn").
checksum (bool, optional) – If True, adds both DATASUM and CHECKSUM cards to the headers of all HDU’s written to the file
index (bool, optional) – If True, includes the index in the FITS file. Default is True.
append (bool, optional) – If True, appends the DataFrame to the FITS file. Default is False.
overwrite (bool, optional) – If True, overwrites the DataFrame in the FITS file. Default is False.
**kwargs (dict) – Additional keyword arguments to pass to the FITS writer.

Return type:

None

pyphot.io.hdf module#

Read and write HDF5 files with pytables preserving metadata (tables, https://www.pytables.org/)

Important

This module relies on pytables

from_hdf5(filename, tablename=None, *, silent=True, **kwargs)[source]#

Generate the corresponding ascii Header that contains all necessary info

Parameters:

filename (str) – file to read from
tablename (str) – node containing the table
silent (bool) – skip verbose messages

Returns:

hdr – string that will be be written at the beginning of the file

Return type:

str

to_hdf5(df, filename, *, tablename=None, header_info=None, mode='w', append=False, **kwargs)[source]#

Write a pandas DataFrame to an HDF5 file.

Parameters:

df (pd.DataFrame) – The DataFrame to write.
filename (str or tables.File or PathLike) – The filename or open HDF5 file to write to.
tablename (str, optional) – The name of the table to write to.
header_info (HeaderInfo, optional) – The header information to write. Default is to use from df.attrs
mode ({'r', 'w', 'a', 'r+'}, default 'w') – The mode to open the file in.
append (bool, default False) – Whether to append data to an existing file.
**kwargs – Additional keyword arguments to pass to tables.open_file.

Raises:

Exception – If the HDF backend does not implement stream.
tables.FileModeError – If the file is already opened in a different mode.
ValueError – If something went wrong without much information from pytables.

Return type:

None

pyphot.io.header module#

Defines the HeaderInfo class that contains the metadata of a file.

class HeaderInfo[source]#

Bases: object

Extracted information from FITS header

__init__(header, alias, units, comments)#

Parameters:

header (Dict[Hashable, Any])
alias (Dict[Hashable, str])
units (Dict[Hashable, str])
comments (Dict[Hashable, str])

Return type:

None

alias: Dict[Hashable, str] = <dataclasses._MISSING_TYPE object>#: Alias dictionary which contains potential mappings of data columns to aliases

comments: Dict[Hashable, str] = <dataclasses._MISSING_TYPE object>#: Comments/description dictionary containing potential mappings of data columns to comments

header: Dict[Hashable, Any] = <dataclasses._MISSING_TYPE object>#: Header dictionary containing any metadata from a file input

units: Dict[Hashable, str] = <dataclasses._MISSING_TYPE object>#: Units dictionary containing potential mappings of data columns to units

pyphot.io.votable module#

VOTable parser for astronomical tabular data.

VOTable is the standard XML format for astronomical tabular data. This module implements a custom VOTableParser that uses XML parsing and not other dependencies. from_votable provides the standard interface of io operations (pandas.DataFrame, HeaderInfo)

class VOTableParser[source]#

Bases: object

A custom VOTable parser using XML parsing.

VOTable is the standard XML format for astronomical tabular data. This example shows how to parse the structure and extract data.

Initialize VOTable parser

Parameters:

source (str or bytes) – Either a file path, URL, or XML string/bytes
is_url (bool) – If True, treat source as URL to fetch

__init__(source, is_url=False)[source]#

Initialize VOTable parser

Parameters:

source (str or bytes) – Either a file path, URL, or XML string/bytes
is_url (bool) – If True, treat source as URL to fetch

get_table_as_dict(table_index=0)[source]#

Convert table data to dictionary with column names as keys

Parameters:: table_index (int) – Index of table to convert (default: 0 for first table)
Returns:: Dictionary with column names as keys and data as lists
Return type:: dict

print_table_info(table_index=0)[source]#

Print information about a table

Parameters:: table_index (int)

from_votable(fname, *, table_index=0, is_url=False)[source]#

Read a VOTable file and return a pandas DataFrame and header information.

Parameters:

fname (str, bytes, IOBase, PathLike) – The filename or file-like object to read.
table_index (int, optional) – The index of the table to read, by default 0.
is_url (bool, optional) – Whether the file is a URL, by default False.

Returns:

A tuple containing the pandas DataFrame and header information.

Return type:

Tuple[pd.DataFrame, HeaderInfo]

to_votable(*args, **kwargs)[source]#

pyphot.io package

Contents

pyphot.io package#

IO module#

Submodules#

pyphot.io.ascii module#

pyphot.io.fits module#

pyphot.io.hdf module#

pyphot.io.header module#

pyphot.io.votable module#