Open In App

Introduction to Searching – Data Structure and Algorithm Tutorial

Last Updated : 23 Apr, 2024
Improve
Improve
Like Article
Like
Save
Share
Report

Searching is the fundamental process of locating a specific element or item within a collection of data. This collection of data can take various forms, such as arrays, lists, trees, or other structured representations.

Searching-algorithm
Introduction to Searching – Data Structure and Algorithm Tutorial

The primary objective of searching is to determine whether the desired element exists within the data, and if so, to identify its precise location or retrieve it. It plays an important role in various computational tasks and real-world applications, including information retrieval, data analysis, decision-making processes, and more.

Importance of Searching in DSA

  • Efficiency: Efficient searching algorithms improve program performance.
  • Data Retrieval: Quickly find and retrieve specific data from large datasets.
  • Database Systems: Enables fast querying of databases.
  • Problem Solving: Used in a wide range of problem-solving tasks.

Characteristics of Searching

Understanding the characteristics of searching in data structures and algorithms is crucial for designing efficient algorithms and making informed decisions about which searching technique to employ. Here, we explore key aspects and characteristics associated with searching:

1. Target Element:

In searching, there is always a specific target element or item that you want to find within the data collection. This target could be a value, a record, a key, or any other data entity of interest.

2. Search Space:

The search space refers to the entire collection of data within which you are looking for the target element. Depending on the data structure used, the search space may vary in size and organization.

3. Complexity:

Searching can have different levels of complexity depending on the data structure and the algorithm used. The complexity is often measured in terms of time and space requirements.

4. Deterministic vs Non-deterministic:

Some searching algorithms, like binary search, are deterministic, meaning they follow a clear and systematic approach. Others, such as linear search, are non-deterministic, as they may need to examine the entire search space in the worst case.

Searching Algorithms:

Searching Algorithms are designed to check for an element or retrieve an element from any data structure where it is stored.

Below are some searching algorithms:

  1. Linear Search
  2. Binary Search
  3. Ternary Search
  4. Jump Search
  5. Interpolation Search
  6. Fibonacci Search
  7. Exponential Search

1. Linear Search:

Linear Search, also known as Sequential Search, is one of the simplest and most straightforward searching algorithms. It works by sequentially examining each element in a collection of data(array or list) until a match is found or the entire collection has been traversed.

Linear-Search
Linear Search

Algorithm of Linear Search:

  • The Algorithm examines each element, one by one, in the collection, treating each element as a potential match for the key you’re searching for.
  • If it finds any element that is exactly the same as the key you’re looking for, the search is successful, and it returns the index of key.
  • If it goes through all the elements and none of them matches the key, then that means “No match is Found”.

Illustration of Linear Search:

Consider the array arr[] = {10, 50, 30, 70, 80, 20, 90, 40} and key = 30

Start from the first element (index 0) and compare key with each element (arr[i]). Comparing key with first element arr[0]. Since not equal, the iterator moves to the next element as a potential match.

Linear-Search-Algorithm-1

Comparing key with next element arr[1]. Since not equal, the iterator moves to the next element as a potential match.

Linear-Search-Algorithm-2

Now when comparing arr[2] with key, the value matches. So the Linear Search Algorithm will yield a successful message and return the index of the element when key is found.

Linear-Search-Algorithm-3

Pseudo Code for Linear Search:

LinearSearch(collection, key):

for each element in collection:

if element is equal to key:

return the index of the element

return “Not found”

Complexity Analysis of Linear Search:

  • Time Complexity:
    • Best Case: In the best case, the key might be present at the first index. So the best case complexity is O(1)
    • Worst Case: In the worst case, the key might be present at the last index i.e., opposite to the end from which the search has started in the list. So the worst-case complexity is O(N) where N is the size of the list.
    • Average Case: O(N)
  • Auxiliary Space: O(1) as except for the variable to iterate through the list, no other variable is used.

When to use Linear Search:

  • When there is small collection of data.
  • When data is unordered.

2. Binary Search:

Binary Search is defined as a searching algorithm used in a sorted array by repeatedly dividing the search interval in half. The idea of binary search is to use the information that the array is sorted and reduce the time complexity to O(log N).

Binary-Seach-1
Binary Search Algorithm

Algorithm of Binary Search:

  • Divide the search space into two halves by finding the middle index “mid”.
  • Compare the middle element of the search space with the key.
  • If the key is found at middle element, the process is terminated.
  • If the key is not found at middle element, choose which half will be used as the next search space.
    • If the key is smaller than the middle element, then the left side is used for next search.
    • If the key is larger than the middle element, then the right side is used for next search.
  • This process is continued until the key is found or the total search space is exhausted.

Illustration of Binary Search:

Consider an array arr[] = {2, 5, 8, 12, 16, 23, 38, 56, 72, 91}, and the target = 23.

  • Calculate the mid and compare the mid element with the key. If the key is less than mid element, move to left and if it is greater than the mid then move search space to the right.
  • Key (i.e., 23) is greater than current mid element (i.e., 16). The search space moves to the right.

Binary-Seach-Algorithm-1

  • Key is less than the current mid 56. The search space moves to the left.

Binary-Seach-Algorithm-2

  • If the key matches the value of the mid element, the element is found and stop search.

Binary-Seach-Algorithm-3

Pseudo Code for Binary Search:

Below is the pseudo code for implementing binary search:

binarySearch(collection, key):

left = 0

right = length(collection) – 1

while left <= right:

mid = (left + right) // 2

if collection[mid] == key:

return mid

elif collection[mid] < key:

left = mid + 1

else:

right = mid – 1

return “Not found”

Complexity Analysis of Binary Search:

  • Time Complexity:
    • Best Case: O(1) – When the key is found at the middle element.
    • Worst Case: O(log N) – When the key is not present, and the search space is continuously halved.
    • Average Case: O(log N)
  • Auxiliary Space: O(1)

When to use Binary Search:

  • When the data collection is monotonic (essential condition) in nature.
  • When efficiency is required, specially in case of large datasets.

3. Ternary Search:

Ternary Search is a searching algorithm that divides the search space into three parts instead of two, as in Binary Search. It is very useful in the case of unimodal functions.

Algorithm Ternary Search:

  • In Ternary Search, start with two midpoints, oneThird and twoThirds, which divide the collection into three roughly equal parts.
  • Compare the elements at oneThird and twoThirds with the target key you’re searching for.
  • Three Possibilities:
    • If oneThird contains the key, you’re done and return the index of oneThird.
    • If twoThirds contains the key, you’re done and return the index of twoThirds.
    • If the key is less than the element at oneThird, eliminate the rightmost one-third of the collection and focus on the left two-thirds.
  • If the key is greater than the element at twoThirds, eliminate the leftmost one-third of the collection and focus on the right two-thirds.
  • Repeat this process iteratively until either key is found or determine that it’s not present in the collection.

Example of Ternary Search:

Consider an array arr[] = {1, 2, 3, 4, 5, 6, 7, 8, 9, 10}, and the target = 6.

Ternary-Search
Ternary Search

Complexity Analysis of Ternary Search:

  • Time Complexity:
    • Best Case: O(1)
    • Worst Case: O(log3N)
    • Average Case: O(log3N)
  • Auxiliary Space: O(1)

4. Jump Search:

Jump Search is another searching algorithm that can be used on sorted collections (arrays or lists). The idea is to reduce the number of comparisons by jumping ahead by fixed steps or skipping some elements in place of searching all elements.

Illustration of Jump Search:

Let’s consider the following array: (0, 1, 1, 2, 3, 5, 8, 13, 21, 34, 55, 89, 144, 233, 377, 610).

The length of the array is 16. The Jump search will find the value of 55 with the following steps assuming that the block size to be jumped is 4.

  • Jump from index 0 to index 4;
  • Jump from index 4 to index 8;
  • Jump from index 8 to index 12;
  • Since the element at index 12 is greater than 55, we will jump back a step to come to index 8.
  • Perform a linear search from index 8 to get the element 55.

Time Complexity of Jump Search:

  • Time Complexity: O(√n), where “n” is the number of elements in the collection. This makes it more efficient than Linear Search but generally less efficient than Binary Search for large datasets.
  • Auxiliary Space: O(1), as it uses a constant amount of additional space for variables.

Performance Comparison based on Complexity:

linear search < jump search < binary search

5. Interpolation Search

Interpolation Search is an efficient searching algorithm for sorted collections of data, such as arrays or lists. It is an improvement over Binary Search, particularly when the data is uniformly distributed.

6. Fibonacci Search

Fibonacci Search is an efficient searching algorithm used for finding a target value in a sorted collection, such as an array or list. It is similar in principle to Binary Search but uses Fibonacci numbers to determine the positions to be compared.

7. Exponential Search

Exponential Search is a searching algorithm designed to find a target value in a sorted collection, such as an array or list. It combines elements of Binary Search and Linear Search to efficiently locate the target, especially when its position is near the beginning of the collection.

Easy Problems on Searching:

  1. Count 1’s in a sorted binary array
  2. Ceiling in a sorted array
  3. k largest(or smallest) elements in an array
  4. Kth smallest element in a row-wise and column-wise sorted 2D array
  5. Given an array of of size n and a number k, find all elements that appear more than n/k times.

Medium problems on Searching:

  1. Find a peak element
  2. Search an element in a sorted and rotated array
  3. Find the minimum element in a sorted and rotated array
  4. Find the closest pair from two sorted arrays
  5. Allocate Minimum Number of Pages from N books to M students
  6. Assign stalls to K cows to maximize the minimum distance between them

Hard problems on Searching:

  1. Median of two sorted arrays
  2. Median of two sorted arrays of different sizes
  3. Search in an almost sorted array
  4. Find position of an element in a sorted array of infinite numbers
  5. Given a sorted and rotated array, find if there is a pair with a given sum
  6. Longest Increasing Subsequence Size (N log N)



Similar Reads

Importance of searching in Data Structure
Searching is a fundamental operation in data structures that involves finding a specific piece of data within a collection. It is crucial for efficiently retrieving information from a dataset, especially when dealing with large amounts of data. Importance of Searching in Data Structures:Searching is a fundamental operation in data structures. It al
3 min read
Static Data Structure vs Dynamic Data Structure
Data structure is a way of storing and organizing data efficiently such that the required operations on them can be performed be efficient with respect to time as well as memory. Simply, Data Structure are used to reduce complexity (mostly the time complexity) of the code. Data structures can be two types : 1. Static Data Structure 2. Dynamic Data
4 min read
Real time optimized KMP Algorithm for Pattern Searching
In the article, we have already discussed the KMP algorithm for pattern searching. In this article, a real-time optimized KMP algorithm is discussed. From the previous article, it is known that KMP(a.k.a. Knuth-Morris-Pratt) algorithm preprocesses the pattern P and constructs a failure function F(also called as lps[]) to store the length of the lon
7 min read
Rabin-Karp algorithm for Pattern Searching in Matrix
Given matrices txt[][] of dimensions m1 x m2 and pattern pat[][] of dimensions n1 x n2, the task is to check whether a pattern exists in the matrix or not, and if yes then print the top most indices of the pat[][] in txt[][]. It is assumed that m1, m2 ? n1, n2 Examples: Input: txt[][] = {{G, H, I, P} {J, K, L, Q} {R, G, H, I} {S, J, K, L} } pat[][]
15+ min read
Introduction to Universal Hashing in Data Structure
Universal hashing is a technique used in computer science and information theory for designing hash functions. It is a family of hash functions that can be efficiently computed by using a randomly selected hash function from a set of hash functions. The goal of universal hashing is to minimize the chance of collisions between distinct keys, which c
5 min read
Introduction to Graph Data Structure
Graph Data Structure is a non-linear data structure consisting of vertices and edges. It is useful in fields such as social network analysis, recommendation systems, and computer networks. In the field of sports data science, graph data structure can be used to analyze and understand the dynamics of team performance and player interactions on the f
15+ min read
Introduction to Tree Data Structure
Tree data structure is a specialized data structure to store data in hierarchical manner. It is used to organize and store data in the computer to be used more effectively. It consists of a central node, structural nodes, and sub-nodes, which are connected via edges. We can also say that tree data structure has roots, branches, and leaves connected
15+ min read
Data Structure Alignment : How data is arranged and accessed in Computer Memory?
Data structure alignment is the way data is arranged and accessed in computer memory. Data alignment and Data structure padding are two different issues but are related to each other and together known as Data Structure alignment. Data alignment: Data alignment means putting the data in memory at an address equal to some multiple of the word size.
4 min read
Difference between data type and data structure
Data Type A data type is the most basic and the most common classification of data. It is this through which the compiler gets to know the form or the type of information that will be used throughout the code. So basically data type is a type of information transmitted between the programmer and the compiler where the programmer informs the compile
4 min read
Difference between Greedy Algorithm and Divide and Conquer Algorithm
Greedy algorithm and divide and conquer algorithm are two common algorithmic paradigms used to solve problems. The main difference between them lies in their approach to solving problems. Greedy Algorithm:The greedy algorithm is an algorithmic paradigm that follows the problem-solving heuristic of making the locally optimal choice at each stage wit
3 min read