Added TTLHeap and fixed readme and docs

This commit is contained in:
nocturn9x 2020-05-31 18:55:16 +00:00
parent 98340973b2
commit ddb60115f8
4 changed files with 147 additions and 187 deletions

171
README.md
View File

@ -2,178 +2,17 @@
`ttlcollections` is a pure Python implementation of a various data structures with built-in TTL (time to live) functionality, using nothing but the standard library.
It takes advantages of the efficient `collections.deque` data structure and of the `heapq` library for fast and efficient queues
## Installation
To install the library from source simply clone this repository and run the command `python3 setup.py install`
To install the library from source, simply clone this repository and run the command `python3 setup.py install`
## Usage
The library will be published on PyPi soon
As of now `ttlcollections` implements 2 data structures: a queue and a stack.
## Docs
The difference between a stack and a queue is that the former follows the LIFO (Last-in First-out) scheme, while the latter follows the FIFO (First-in First out) scheme.
These 2 collections are implemented using `collections.deque` for fast `O(1)` complexity when accessing elements trough `pop`/`get` and `push`/`put`
### Example - TTLQueue
```python
from ttlcollections import TTLQueue
import time
q = TTLQueue(ttl=60) # Elements that are older than 60 seconds will be deleted
q.put(1)
time.sleep(60)
q.get() # Will raise ttlcollections.errors.QueueEmpty because the TTL for 1 has expired
```
#### Methods
This is the methods documentation for TTLQueue in Sphinx format
##### `TTLQueue` - `__init__()`
A FIFO data structure with per-item time to live (TTL)
All items will have a default time to live, after that has
expired (on the next mutating operation a.k.a put or get)
expired elements will be popped out automatically.
It is also possible to set a different TTL for every item and to
define the maximum queue size.
__Note__: This queue is __NOT__ thread safe and must be properly locked
when used with multiple threads
```
:param qsize: The max size of the queue, defaults to 0 (no limit)
:type qsize: int, optional
:param ttl: The TTL for every item in the queue, defaults to 0 (no TTL)
:type ttl: int, optional
:param timer: The timer function that the queue will use to
keep track of elapsed time. Defaults to time.monotonic(), but can
be customized. Any function that yields an incremental value
on each subsequent call is acceptable, but its return values
should not be repeated during runtime to avoid nonsense results
:type timer: class: FunctionType, optional
```
##### `TTLQueue` - `put()`
Pops expired element out of the queue if their TTL has expired by when units of time (usually seconds)
```
:param when: The expiry date to check items against. Items' whose
insertion date, according to self.timer, is less or equal
than this number will be automatically deleted
:type when: int
```
##### `TTLQueue` - `get()`
Puts an item onto the queue
```
:param element: The element to put in the queue
:type element: object
:param ttl: If you want to override the default ttl
of the class for a specific element, you can specify
that, defaults to 0 (use the default TTL)
:param ttl: int, optional
:raises QueueFull: If the queue is full
```
##### `TTLQueue` - `expire()`
Pops expired element out of the queue if their TTL has expired by `when` units of time (usually seconds)
```
:param when: The expiry date to check items against. Items' whose
insertion date, according to self.timer, is less or equal
than this number will be automatically deleted
:type when: int
```
### Example - TTLStack
```python
from ttlcollections import TTLStack
import time
q = TTLStack(ttl=60) # Elements that are older than 60 seconds will be deleted
q.push(1)
time.sleep(60)
q.pop() # Will raise ttlcollections.errors.StackEmpty because the TTL for 1 has expired
```
#### Methods
This is the methods documentation for TTLStack in Sphinx format
##### `TTLStack` - `__init__()`
A stack-like (LIFO) data structure with per-item time to live (TTL).
All items will have a default time to live, after that has
expired (on the next mutating operation a.k.a push or pop)
elements will be popped out automatically.
It is also possible to set a different TTL for every item and to define the maximum stack size
**Note**: This stack is __NOT__ thread safe and must be properly locked when used with multiple threads
```
:param size: The max size of the stack, defaults to 0 (no limit)
:type size: int, optional
:param ttl: The TTL for every item in the stack, defaults to 0 (no TTL)
:type ttl: int, optional
:param timer: The timer function that the stack will use to
keep track of elapsed time. Defaults to time.monotonic(), but can
be customized. Any function that yields an incremental value
on each subsequent call is acceptable, but its return values
should not be repeated during runtime to avoid nonsense results
:type timer: class: FunctionType, optional
```
##### `TTLStack` - `push()`
Pushes an item onto the stack
```
:param element: The element to push
:type element: object
:param ttl: If you want to override the default ttl
of the class for a specific element, you can specify
that, defaults to 0 (use the default TTL)
:param ttl: int, optional
:raises StackFull: If the stack is full
```
##### `TTLStack` - `pop()`
Pops an item from the stack, raising StackEmpty if the stack is empty
```
:raises StackEmpty: If the stack is empty
```
##### `TTLStack` - `expire()`
Pops expired element out of the stack if their TTL has expired by `when` units of time (usually seconds)
```
:param when: The expiry date to check items against.
Items' whose insertion date, according to self.timer, is less or equal than this number
will be automatically deleted
:type when: int
```
#### Notes
Please consider that the TTL expiration check is done at every mutating operation (e.g. `put` or `pop`) and that already expired elements may take space in memory until one of these methods is called.
You can force the TTL expiration check by calling the `expire` method. This method takes a single parameter and will delete all items in the collection that have expired the by the current time (according to the timer function) plus the specified amount of time units (usually seconds). For the current time you can use the collection's internal timer function (e.g. `TTLQueue.timer()`)
The official documentation will be available soon
## Credits

View File

@ -1,5 +1,5 @@
from .structures import TTLQueue, TTLStack
from .structures import TTLQueue, TTLStack, TTLHeap
from .errors import QueueEmpty, StackEmpty, QueueFull, StackFull
__all__ = ["TTLQueue", "TTLStack", "QueueEmpty", "QueueFull", "StackEmpty", "StackFull"]
__all__ = ["TTLQueue", "TTLStack", "TTLHeap", "QueueEmpty", "QueueFull", "StackEmpty", "StackFull"]
__version__ = (1, 0, 0)

23
ttlcollections/objects.py Normal file
View File

@ -0,0 +1,23 @@
class TTLItem(object):
"""An abstraction layer over a queue item. This class is meant
to be used internally and should not be instantiated directly
:param obj: The object to be wrapped
:type obj: object
:param date: The expiration date
:type date: float
"""
def __init__(self, obj: object, date: float):
"""Object constructor"""
self.obj = obj
self.date = date
def __lt__(self, other):
if isinstance(other, TTLItem):
other = other.obj
return self.obj.__lt__(other)

View File

@ -3,14 +3,18 @@ from time import monotonic
from types import FunctionType
import math
from .errors import QueueEmpty, QueueFull, StackEmpty, StackFull
from heapq import heappush, heappop
from .objects import TTLItem
class TTLQueue:
"""A FIFO data structure with per-item time to live (TTL)
All items will have a default time to live, after that has
expired (on the next mutating operation a.k.a put or get)
expired elements will be popped out automatically.
"""A queue (FIFO) with per-item time to live (TTL)
When a TTL expires, its associated element will be deleted, but please
note that TTL expiration (and therefore, items deletion) is performed
only when doing mutating operations on the queue itself (put and get)
It is also possible to set a different TTL for every item and to
define the maximum queue size
@ -51,10 +55,10 @@ class TTLQueue:
n = len(self._queue)
while i < n:
try:
date, element = self._queue[i]
item = self._queue[i]
except IndexError:
break
if date <= when:
if item.date <= when:
del self._queue[i]
i += 1
@ -65,7 +69,7 @@ class TTLQueue:
:type element: object
:param ttl: If you want to override the default ttl
of the class for a specific element, you can specify
that, defaults to 0 (use the default TTL)
that, defaults to 0 (use self.ttl)
:param ttl: int, optional
:raises QueueFull: If the queue is full
"""
@ -73,7 +77,7 @@ class TTLQueue:
ttl = ttl if ttl else self.ttl
self.expire(self.timer())
if len(self._queue) < self.qsize:
self._queue.append((self.timer() + ttl, element))
self._queue.append(TTLItem(element, ttl + self.timer()))
else:
raise QueueFull("The queue is full!")
@ -85,29 +89,33 @@ class TTLQueue:
self.expire(self.timer())
if not self._queue:
raise QueueEmpty("The queue is empty!")
return self._queue.popleft()[1]
return self._queue.popleft().obj
def __repr__(self):
"""Implements repr(self)"""
string = "TTLQueue({list}, qsize={qsize}, ttl={ttl}, timer={timer})"
values = [t[1] for t in self._queue]
values = [t.obj for t in self._queue]
return string.format(list=values, qsize=self.qsize, ttl=self.ttl, timer=self.timer)
def __iter__(self):
"""Implements iter(self)"""
for _, element in self._queue:
yield element
for element in self._queue:
yield element.obj
class TTLStack:
"""A stack-like (LIFO) data structure with per-item time to live (TTL)
All items will have a default time to live, after that has
expired (on the next mutating operation a.k.a push or pop)
expired elements will be popped out automatically.
"""A stack (LIFO) with per-item time to live (TTL)
All items inside the stack will be associated to a TTL (time to live).
When a TTL expires, its associated element will be deleted, but please
note that TTL expiration (and therefore, items deletion) is performed
only when doing mutating operations on the stack itself (push and pop)
It is also possible to set a different TTL for every item and to
define the maximum stack
define the maximum stack size
Note: This stack is NOT thread safe and must be properly locked
when used with multiple threads
@ -139,7 +147,7 @@ class TTLStack:
:type element: object
:param ttl: If you want to override the default ttl
of the class for a specific element, you can specify
that, defaults to 0 (use the default TTL)
that, defaults to 0 (use self.ttl)
:param ttl: int, optional
:raises StackFull: If the stack is full
"""
@ -182,7 +190,7 @@ class TTLStack:
i = 0
n = len(self._stack)
while i < n:
while i < n: # Using a for loop would raise RuntimeError
try:
date, element = self._stack[i]
except IndexError:
@ -190,3 +198,93 @@ class TTLStack:
if date <= when:
del self._stack[i]
i += 1
class TTLHeap(TTLQueue):
"""A heap queue with per-item time to live (TTL)
All items inside the queue will be associated to a TTL (time to live).
When a TTL expires, its associated element will be deleted, but please
note that TTL expiration (and therefore, items deletion) is performed
only when doing mutating operations on the queue itself (put and get)
It is also possible to set a different TTL for every item and to
define the maximum queue size
Note: This queue is NOT thread safe and must be properly locked
when used with multiple threads
:param qsize: The max size of the queue, defaults to 0 (no limit)
:type qsize: int, optional
:param ttl: The TTL for every item in the queue, defaults to 0 (no TTL)
:type ttl: int, optional
:param timer: The timer function that the queue will use to
keep track of elapsed time. Defaults to time.monotonic(), but can
be customized. Any function that yields an incremental value
on each subsequent call is acceptable, but its return values
should not be repeated during runtime to avoid nonsense results
:type timer: class: FunctionType, optional
"""
def __init__(self, qsize: int = 0, ttl: int = 0, timer: FunctionType = monotonic):
"""Object constructor"""
super().__init__(qsize, ttl, timer)
self._queue = []
def __iter__(self):
"""Implements iter(self)"""
super().__iter__()
def __repr__(self):
"""Implements repr(self)"""
string = "TTLHeap({list}, qsize={qsize}, ttl={ttl}, timer={timer})"
values = [t.obj for t in self._queue]
return string.format(list=values, qsize=self.qsize, ttl=self.ttl, timer=self.timer)
def put(self, element, ttl: int = 0):
"""Puts an item onto the queue
:param element: The element to put in the queue
:type element: object
:param ttl: If you want to override the default ttl
of the class for a specific element, you can specify
that, defaults to 0 (use self.ttl)
:param ttl: int, optional
:raises QueueFull: If the queue is full
"""
ttl = ttl if ttl else self.ttl
self.expire(self.timer())
if len(self._queue) < self.qsize:
heappush(self._queue, TTLItem(element, self.timer() + ttl))
else:
raise QueueFull("The queue is full!")
def get(self):
"""Gets an item from the queue, raising QueueEmpty if the
queue is empty
"""
self.expire(self.timer())
if not self._queue:
raise QueueEmpty("The queue is empty!")
return heappop(self._queue).obj
def expire(self, when: int):
"""Pops expired element out of the queue if their TTL has
expired by when units of time (usually seconds)
:param when: The expiry date to check items against. Items' whose
insertion date, according to self.timer, is less or equal
than this number will be automatically deleted
:type when: int
"""
for item in self._queue:
if item.date <= when:
self._queue.remove(item)