Anyone know this Python data structure?
The Python class has six requirements as listed below. Only bold terms are to be read as requirements.
- Close to O(1) performance for as many of the following four operations.
- Maintaining sorted order while inserting an object into the container.
- Ability to peek at last value (the largest value) contained in the object.
- Allowing for pops on both sides (getting the smallest or largest values).
- Capability of getting the total size or number of objects being stored.
- Being a ready made solution like the code in Python's standard library.
What follows is left here for historical reasons (help the curious and prove that research was conducted).
After looking through Python's Standard Library (specifically the section on Data Types), I still have not found a class that fulfills the requirements requirements of a fragmentation table. collections.deque
is close to what is required, but it does not support keeping the data contained in it sorted. It provides:
- Efficient append and pops on either side of a deque with O(1) performance.
- Pops on both sides for the data contained with开发者_StackOverflowin the object.
- Getting the total size or count of objects contained within.
Implementing an inefficient solution using lists would be trivial, but finding a class that performs well would be far more desirable. In a growing memory simulation with no upper limit, such a class could keep indexes of empty (deleted) cells and keep fragmentation levels down. The bisect
module may help:
- Helps keep an array in sorted order while inserting new objects in array.
- Ready made solution for keeping lists sorted as objects are added.
- Would allow executing
array[-1]
to peek at last value in the array.
The final candidate that failed to fully satisfy the requirements and appeared least promising was the heapq
module. While supporting what looked like efficient insertions and assuring that array[0]
was the smallest value, the array is not always in a fully sorted state. Nothing else was found to be as helpful.
Does anyone know of a class or data structure in Python that comes close to these six requirements?
Your requirements seem to be:
- O(1) pop from each end
- Efficient
len
- Sorted order
- Peek at last value
for which you can use a deque
with a custom insert
method which rotates the deque, appends to one end, and unrotates.
>>> from collections import deque
>>> import bisect
>>> class FunkyDeque(deque):
... def _insert(self, index, value):
... self.rotate(-index)
... self.appendleft(value)
... self.rotate(index)
...
... def insert(self, value):
... self._insert(bisect.bisect_left(self, value), value)
...
... def __init__(self, iterable):
... super(FunkyDeque, self).__init__(sorted(iterable))
...
>>> foo = FunkyDeque([3,2,1])
>>> foo
deque([1, 2, 3])
>>> foo.insert(2.5)
>>> foo
deque([1, 2, 2.5, 3])
Notice that requirements 1, 2, and 4 all follow directly from the fact that the underlying data structure is a deque, and requirement 3 holds because of the way data is inserted. (Note of course that you could bypass the sorting requirement by calling e.g. _insert
, but that's beside the point.)
Many thanks go out to katrielalex
for providing the inspiration that led to the following Python class:
import collections
import bisect
class FastTable:
def __init__(self):
self.__deque = collections.deque()
def __len__(self):
return len(self.__deque)
def head(self):
return self.__deque.popleft()
def tail(self):
return self.__deque.pop()
def peek(self):
return self.__deque[-1]
def insert(self, obj):
index = bisect.bisect_left(self.__deque, obj)
self.__deque.rotate(-index)
self.__deque.appendleft(obj)
self.__deque.rotate(index)
blist.sortedlist
- Close to O(1) performance for as many of the following four operations.
- Maintaining sorted order while inserting an object into the container.
- Ability to peek at last value (the largest value) contained in the object.
- Allowing for pops on both sides (getting the smallest or largest values).
- Capability of getting the total
size ornumber of objects being stored. - Being a ready made solution like the code in Python's standard library.
It's a B+ Tree.
精彩评论