Data Models¶

This section covers Athena query execution models and configuration classes.

Query Execution¶

class pyathena.model.AthenaQueryExecution(response: Dict[str, Any])[source]¶

Represents an Athena query execution with status and metadata.

This class encapsulates information about a query execution in Amazon Athena, including its current state, statistics, error information, and result metadata. It’s primarily used internally by PyAthena cursors but can be useful for monitoring and debugging query execution.

Query States:

QUEUED: Query is waiting to be executed
RUNNING: Query is currently executing
SUCCEEDED: Query completed successfully
FAILED: Query execution failed
CANCELLED: Query was cancelled

Statement Types:

DDL: Data Definition Language (CREATE, DROP, ALTER)
DML: Data Manipulation Language (SELECT, INSERT, UPDATE, DELETE)
UTILITY: Utility statements (SHOW, DESCRIBE, EXPLAIN)

Example

>>> # Typically accessed through cursor execution
>>> cursor.execute("SELECT COUNT(*) FROM my_table")
>>> query_execution = cursor._last_query_execution  # Internal access
>>> print(f"Query ID: {query_execution.query_id}")
>>> print(f"State: {query_execution.state}")
>>> print(f"Data scanned: {query_execution.data_scanned_in_bytes} bytes")

Session Management¶

class pyathena.model.AthenaSessionStatus(response: Dict[str, Any])[source]¶

STATE_CREATING: str = 'CREATING'¶

STATE_CREATED: str = 'CREATED'¶

STATE_IDLE: str = 'IDLE'¶

STATE_BUSY: str = 'BUSY'¶

STATE_TERMINATING: str = 'TERMINATING'¶

STATE_TERMINATED: str = 'TERMINATED'¶

STATE_DEGRADED: str = 'DEGRADED'¶

STATE_FAILED: str = 'FAILED'¶

__init__(response: Dict[str, Any]) → None[source]¶

property session_id: str | None¶

property state: str | None¶

property state_change_reason: str | None¶

property start_date_time: datetime | None¶

property last_modified_date_time: datetime | None¶

property end_date_time: datetime | None¶

property idle_since_date_time: datetime | None¶

Database and Table Metadata¶

class pyathena.model.AthenaDatabase(response)[source]¶

__init__(response)[source]¶

property name: str | None¶

property description: str | None¶

property parameters: Dict[str, str]¶

class pyathena.model.AthenaTableMetadata(response)[source]¶

__init__(response)[source]¶

property name: str | None¶

property create_time: datetime | None¶

property last_access_time: datetime | None¶

property table_type: str | None¶

property columns: List[AthenaTableMetadataColumn]¶

property partition_keys: List[AthenaTableMetadataPartitionKey]¶

property parameters: Dict[str, str]¶

property comment: str | None¶

property location: str | None¶

property input_format: str | None¶

property output_format: str | None¶

property row_format: str | None¶

property file_format: str | None¶

property serde_serialization_lib: str | None¶

property compression: str | None¶

property serde_properties: Dict[str, str]¶

property table_properties: Dict[str, str]¶

File Formats and Compression¶

class pyathena.model.AthenaFileFormat[source]¶

Constants and utilities for Athena supported file formats.

This class provides constants for file formats supported by Amazon Athena and utility methods to check format types. These are commonly used when creating tables or configuring UNLOAD operations.

Supported formats:

SEQUENCEFILE: Hadoop SequenceFile format
TEXTFILE: Plain text files (default)
RCFILE: Record Columnar File format
ORC: Optimized Row Columnar format
PARQUET: Apache Parquet columnar format
AVRO: Apache Avro format
ION: Amazon Ion format

Example

>>> from pyathena.model import AthenaFileFormat
>>>
>>> # Check if format is Parquet
>>> if AthenaFileFormat.is_parquet("PARQUET"):
...     print("Using columnar format")
>>>
>>> # Use in UNLOAD operations
>>> format_type = AthenaFileFormat.FILE_FORMAT_PARQUET
>>> sql = f"UNLOAD (...) TO 's3://bucket/path/' WITH (format = '{format_type}')"
>>> cursor.execute(sql)