In Python, sets are a built-in data type that represents an unordered collection of unique elements. Sets are mutable, meaning you can add or remove elements, but they do not allow duplicate values. They are particularly useful for operations involving membership testing, removing duplicates from a sequence, and computing mathematical operations like union, intersection, and difference.
Characteristics of Sets
- Unordered: The elements in a set do not have a specific order. As a result, sets do not support indexing, slicing, or other sequence-like behavior.
- Mutable: Sets can be modified after creation by adding or removing elements.
- Unique Elements: Sets automatically eliminate duplicate elements.
- Unhashable Elements Prohibited: Elements of a set must be hashable, meaning mutable data types like lists or dictionaries cannot be elements of a set.
Creating Sets
Sets can be created using curly braces {} or the set() constructor.
# Using curly braces
fruits = {"apple", "banana", "cherry"}
# Using set() constructor
numbers = set([1, 2, 3, 4, 5])
# An empty set (note: {} creates an empty dictionary, not a set)
empty_set = set()
Adding and Removing Elements
- Adding Elements: Use the add() method to add a single element and update() to add multiple elements.
fruits.add("orange") # {"apple", "banana", "cherry", "orange"}
fruits.update(["mango", "grape"]) # {"apple", "banana", "cherry", "orange", "mango", "grape"}
Removing Elements:
- remove(): Removes an element; raises a KeyError if the element is not present.
fruits.remove("banana") # {"apple", "cherry", "orange", "mango", "grape"}
- discard(): Removes an element if present; does nothing if the element is not present.
fruits.discard("banana") # No error, even if "banana" is not in the set
- pop(): Removes and returns an arbitrary element; raises a KeyError if the set is empty.
item = fruits.pop() # Removes an arbitrary element
- clear(): Removes all elements from the set.
fruits.clear() # Now fruits is an empty set
Set Operations
- Union: Combines all elements from both sets (duplicates are removed).
set1 = {1, 2, 3}
set2 = {3, 4, 5}
union_set = set1 | set2 # {1, 2, 3, 4, 5}
Intersection: Contains only elements that are in both sets.
intersection_set = set1 & set2 # {3}
Difference: Contains elements that are in the first set but not in the second.
difference_set = set1 - set2 # {1, 2}
Symmetric Difference: Contains elements that are in either of the sets but not in both.
symmetric_difference_set = set1 ^ set2 # {1, 2, 4, 5}
Other Set Methods
- copy(): Returns a shallow copy of the set.
copy_set = set1.copy()
- isdisjoint(): Returns True if two sets have no elements in common.
is_disjoint = set1.isdisjoint(set2) # False
- issubset(): Returns True if all elements of the set are in another set.
is_subset = {1, 2}.issubset(set1) # True
- issuperset(): Returns True if all elements of another set are in the set.
is_superset = set1.issuperset({1, 2}) # True
Frozen Sets
A frozenset is an immutable version of a set. Once created, elements cannot be added or removed, making it hashable and thus usable as a key in dictionaries or an element of another set.
frozen = frozenset([1, 2, 3])
Use Cases of Sets
1.Removing Duplicates: Sets automatically remove duplicates, making them useful for filtering unique items from a collection.
unique_numbers = set([1, 2, 2, 3, 4, 4, 5]) # {1, 2, 3, 4, 5}
2. Membership Testing: Checking for the presence of an element in a set is faster than in a list due to the underlying hash table structure.
has_apple = "apple" in fruits # True
3.Mathematical Set Operations: Useful for operations like union, intersection, and difference, often used in data science, databases, and more.
4.Data Integrity: Ensuring a collection of unique elements, like in scenarios requiring unique IDs, usernames, etc.
Sets are a versatile and powerful data type in Python, particularly useful for managing collections of unique elements and performing efficient membership checks. Their built-in operations make them ideal for tasks involving set theory and data deduplication. Understanding how to use sets effectively can significantly simplify many programming tasks that involve data collection and manipulation.
'Python > Data type' 카테고리의 다른 글
Python Data type - dictionary (0) | 2024.08.03 |
---|---|
Python Data type - tuple (0) | 2024.08.03 |
Python Data type - list (0) | 2024.07.30 |
Python data type - String (0) | 2024.07.30 |
Python data types - number (0) | 2024.07.30 |