fastest way to detect if duplicate entry exists in javascript array?

Question

var arr = ['test0','test2','test0'];

Like the above,there are two identical entries with value "test0",how to check it most efficiently?

What criteria of efficiency? Speed? Code readability? Memory usage? — Anatoliy
– Anatoliy, Commented Oct 14, 2009 at 7:14
Ummm... why was this question edited instead of posted as a new one? I'm reverting to the original, since now the answers make no sense. — nickf
– nickf, Commented Oct 14, 2009 at 7:30
Mask: post a new question, instead of changing this one to a completely different question. — Miles
– Miles, Commented Oct 14, 2009 at 7:41

Guffa · Accepted Answer · 2009-10-14 07:21:55Z

16

If you sort the array, the duplicates are next to each other so that they are easy to find:

arr.sort();
var last = arr[0];
for (var i=1; i<arr.length; i++) {
   if (arr[i] == last) alert('Duplicate : '+last);
   last = arr[i];
}

answered Oct 14, 2009 at 7:21

Guffa

703k111 gold badges760 silver badges1k bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

Tim Down Over a year ago

... assuming your values are all strings or numbers. Sorting will do no good for an arbitrary array of objects.

Skilldrick Over a year ago

If it's an arbitrary array of objects then it's also difficult to check if they're identical.

Guffa Over a year ago

@Tim: Good point. However, if the can't be sorted, they can hardly be compared to look for duplicates...

Tim Down Over a year ago

You can check whether any array values are identical using the identity operator (===). Sorting an array of objects to bring duplicates to the start requires knowledge of the types of objects being compared in order to decide how to compare them.

Tim Down · Accepted Answer · 2009-10-14 11:28:50Z

6

This will do the job on any array and is probably about as optimized as possible for handling the general case (finding a duplicate in any possible array). For more specific cases (e.g. arrays containing only strings) you could do better than this.

function hasDuplicate(arr) {
    var i = arr.length, j, val;

    while (i--) {
        val = arr[i];
        j = i;
        while (j--) {
            if (arr[j] === val) {
                return true;
            }
        }
    }
    return false;
}

edited Oct 14, 2009 at 11:28

answered Oct 14, 2009 at 10:32

Tim Down

326k76 gold badges461 silver badges546 bronze badges

Comments

Dylan Watson · Accepted Answer · 2019-04-19 09:16:57Z

6

There are lots of answers here but not all of them "feel" nice... So I'll throw my hat in.

If you are using lodash:

function containsDuplicates(array) {
  return _.uniq(array).length !== array.length; 
}

If you can use ES6 Sets, it simply becomes:

function containsDuplicates(array) {
  return array.length !== new Set(array).size
}

With vanilla javascript:

function containsDuplicates(array) {
  return array
    .sort()
    .some(function (item, i, items) {
      return item === items[i + 1]
    })
}

However, sometimes you may want to check if the items are duplicated on a certain field.

This is how I'd handle that:

containsDuplicates([{country: 'AU'}, {country: 'UK'}, {country: 'AU'}], 'country')

function containsDuplicates(array, attribute) {
  return array
    .map(function (item) { return item[attribute] })
    .sort()
    .some(function (item, i, items) {
      return item === items[i + 1]
    })
}

edited Apr 19, 2019 at 9:16

answered Nov 8, 2017 at 2:52

Dylan Watson

2,3232 gold badges20 silver badges38 bronze badges

2 Comments

Himanshu Shekhar Over a year ago

Small Typo. Instead of new Set(array).length it should be new Set(array).size. So function will be containsDuplicates(array) { return array.length === new Set(array).size }

tnsaturday Over a year ago

Have you performed perfomance testing on your solutions? They are in no means "fastest", especially the one using sort and some.

Anatoliy · Accepted Answer · 2009-10-14 08:44:42Z

3

Loop stops when found first duplicate:

function has_duplicates(arr) {

    var x = {}, len = arr.length;
    for (var i = 0; i < len; i++) {
        if (x[arr[i]]) {
             return true;
        }
        x[arr[i]] = true;
    }
    return false;

}

Edit (fix 'toString' issue):

function has_duplicates(arr) {

    var x = {}, len = arr.length;
    for (var i = 0; i < len; i++) {
        if (x[arr[i]] === true) {
             return true;
        }
        x[arr[i]] = true;
    }
    return false;

}

this will correct for case has_duplicates(['toString']); etc..

edited Oct 14, 2009 at 8:44

answered Oct 14, 2009 at 7:18

Anatoliy

30.2k5 gold badges48 silver badges47 bronze badges

4 Comments

bobince Over a year ago

Careful: using Object as a map has difficulties. Keys may only be strings, and names that are members of Object will confuse it. eg. has_duplicates(['toString']) is true.

Tim Down Over a year ago

As bobince says. All keys will be converted to strings. So the above would give a false positive with the following: [ {a: 1}, {b: 2} ] since both members of the array will become "[object Object]" when being stored as keys in x.

Tim Down Over a year ago

Gah, I hate SO on this. I downvoted and commented on your original answer and now cannot remove it.

Anatoliy Over a year ago

In original question array contains only strings. You are correct -- my solition is not working for all possible types, but there is other task.

djechlin · Accepted Answer · 2015-12-01 19:37:10Z

1

Sorting is O(n log n) and not O(n). Building a hash map is O(n). It costs more memory than an in-place sort but you asked for the "fastest." (I'm positive this can be optimized but it is optimal up to a constant factor.)

function hasDuplicate(arr) {
  var hash = {};
  var hasDuplicate = false;
   arr.forEach(function(val) {
     if (hash[val]) {
       hasDuplicate = true;
       return;
     }
     hash[val] = true;
  });
  return hasDuplicate;
}

answered Dec 1, 2015 at 19:37

djechlin

61.1k40 gold badges173 silver badges300 bronze badges

Comments

Anoop Pete · Accepted Answer · 2016-05-24 16:50:02Z

1

    var index = myArray.indexOf(strElement);
    if (index < 0) {
        myArray.push(strElement);
        console.log("Added Into Array" + strElement);
    } else {
        console.log("Already Exists at " + index);
    }

answered May 24, 2016 at 16:50

Anoop Pete

4932 gold badges5 silver badges18 bronze badges

1 Comment

Naman Over a year ago

You shall add the details to why and how is your method faster for a brief explanation.

Sushanth -- · Accepted Answer · 2022-10-18 17:31:53Z

1

You can convert the array to to a Set instance, then convert to an array and check if the length is same before and after the conversion.

const hasDuplicates = (array) => {
  const arr = ['test0','test2','test0'];
  const uniqueItems = new Set(array);
  
  return array.length !== uniqueItems.size();
};

console.log(`Has duplicates : ${hasDuplicates(['test0','test2','test0'])}`);
console.log(`Has duplicates : ${hasDuplicates(['test0','test2','test3'])}`);

edited Oct 18, 2022 at 17:31

answered Sep 20, 2018 at 18:36

Sushanth --

55.8k9 gold badges70 silver badges109 bronze badges

2 Comments

Ma Jerez Over a year ago

Set has a size property, you don't need convert it back to array.

Sushanth -- Over a year ago

Thats a good point.

tnsaturday · Accepted Answer · 2022-06-14 13:49:13Z

It depends on the input array size. I've done some performance tests with Node.js performance hooks and found out that for really small arrays (1,000 to 10,000 entries) Set solution might be faster. But if your array is bigger (like 100,000 elements) plain Object (i. e. hash) solution becomes faster. Here's the code so you can try it out for yourself:

const { performance } = require('perf_hooks');

function objectSolution(nums) {
  let testObj = {};
  for (var i = 0; i < nums.length; i++) {
    let aNum = nums[i];
    if (testObj[aNum]) {
      return true;
    } else {
      testObj[aNum] = true;
    }
  }

  return false;
}

function setSolution(nums) {
  let testSet = new Set(nums);
  return testSet.size !== nums.length;
}

function sortSomeSolution(nums) {
  return nums
    .sort()
    .some(function (item, i, items) {
      return item === items[i + 1]
    })
}

function runTest(testFunction, testArray) {
  console.log('   Running test:', testFunction.name);
  let start = performance.now();
  let result = testFunction(testArray);
  let end = performance.now();
  console.log('      Duration:', end - start, 'ms');
}

let arr = [];
let setSize = 100000;
for (var i = 0; i < setSize; i++) {
  arr.push(i);
}

console.log('Set size:', setSize);
runTest(objectSolution, arr);
runTest(setSolution, arr);
runTest(sortSomeSolution, arr);

On my Lenovo IdeaPad with i3-8130U Node.js v. 16.6.2 gives me following results for the array of 1,000:

results for the array of 100,000:

Rodrigo · Accepted Answer · 2015-12-01 19:23:01Z

-1

Assuming all you want is to detect how many duplicates of 'test0' are in the array. I guess an easy way to do that is to use the join method to transform the array in a string, and then use the match method.

var arr= ['test0','test2','test0'];
var str = arr.join();

console.log(str) //"test0,test2,test0"

var duplicates = str.match(/test0/g);
var duplicateNumber = duplicates.length;

console.log(duplicateNumber); //2

answered Dec 1, 2015 at 19:23

Rodrigo

1012 silver badges6 bronze badges

Collectives™ on Stack Overflow

fastest way to detect if duplicate entry exists in javascript array?

9 Answers 9

4 Comments

Comments

2 Comments

4 Comments

Comments

1 Comment

2 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

9 Answers 9

4 Comments

Comments

2 Comments

4 Comments

Comments

1 Comment

2 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related