Find and list duplicates in a list?

Question

How can I find the duplicates in a Python list and create another list of the duplicates? The list is just integers.

@LevLevitsky Ok, so find one of the 100s of identical Python questions. — agf, Mar 23 '12 at 8:16
possible duplicate of How do you remove duplicates from a list in Python whilst preserving order? — DhruvPathak, Mar 23 '12 at 8:29
do you want the duplicates once, or every time it is seen again? — moooeeeep, Mar 23 '12 at 9:21
Whilst there are many other questions about how to remove duplicates from Python lists, none that I could find demonstrate how to add the duplicates to another list. — MFB, Mar 25 '12 at 22:33
Someone should make a list of those questions and write a script to remove duplicates. — James Bradbury, Aug 7 '14 at 15:00

Community · Accepted Answer · 2017-05-23 10:31:37Z

up vote 203 down vote accepted

To remove duplicates use set(a), to print duplicates - something like

a = [1,2,3,2,1,5,6,5,5,5]

import collections
print [item for item, count in collections.Counter(a).items() if count > 1]

## [1, 2, 5]

Note that Counter is not particularly efficient (timings) and probably an overkill here, set will perform better:

seen = set()
uniq = []
for x in a:
    if x not in seen:
        uniq.append(x)
        seen.add(x)

or, more concisely:

seen = set()
uniq = [x for x in a if x not in seen and not seen.add(x)]

I don't recommend the latter style though.

If list elements are not hashable, you cannot use set/dicts and have to resort to a quadratic time solution (compare each which each), for example:

a = [ [1], [2], [3], [1], [5], [3] ]

no_dupes = [x for n, x in enumerate(a) if x not in a[:n]]
print no_dupes # [[1], [2], [3], [5]]

dupes = [x for n, x in enumerate(a) if x in a[:n]]
print dupes # [[1], [3]]

edited May 23 at 10:31

Community♦

11

answered Mar 23 '12 at 8:05

georg

109k22150227

12

Great answer but NB: needs 2.7 to work – Jura Aug 22 '12 at 19:03

5

The second and third methods only remove duplicates, but don't list them: uniq == [1, 2, 3, 5, 6] and seen == set([1, 2, 3, 5, 6]). – Hugo Dec 16 '15 at 21:57

1

@eric: I guess it's O(n), because it only iterates the list once and set lookups are O(1). – georg Feb 4 '16 at 7:28

2

@Hugo, I corrected the third example – yota Feb 27 '16 at 21:48

2

@Hugo, to see the list of duplicates, we just need to create a new list called dup and add an else statement. For example: dup = [] else: dup.append(x) – Chris Nielsen Apr 29 '16 at 16:45

| show 6 more comments

Ritesh Kumar · Answer 2 · 2013-03-01 10:19:20Z

up vote 162 down vote

>>> l = [1,2,3,4,4,5,5,6,1]
>>> set([x for x in l if l.count(x) > 1])
set([1, 4, 5])

answered Mar 1 '13 at 10:19

Ritesh Kumar

1,629162

5

+1, very simple and works in 2.6 – JohnJ Jun 20 '13 at 20:08

2

Is there any reason you use a list comprehension instead of a generator comprehension? – user3892448 Sep 4 '14 at 13:08

14

Indeed, a simple solution, but complexity is squared because each count() parses the list all over again, so don't use for large lists. – danuker Feb 10 '15 at 15:35

1

@JohnJ, bubble sort is also simple and works. That doesn't mean we should use it! – John La Rooy Dec 17 '15 at 19:51

@JohnLaRooy It actually does mean we should not use it, because there is almost always a more efficient (and simpler) way to do sort. – lostsoul29 Mar 13 at 22:49

add a comment |

Community · Answer 3 · 2017-05-23 12:26:34Z

You don't need the count, just whether or not the item was seen before. Adapted that answer to this problem:

def list_duplicates(seq):
  seen = set()
  seen_add = seen.add
  # adds all elements it doesn't know yet to seen and all other to seen_twice
  seen_twice = set( x for x in seq if x in seen or seen_add(x) )
  # turn the set into a list (as requested)
  return list( seen_twice )

a = [1,2,3,2,1,5,6,5,5,5]
list_duplicates(a) # yields [1, 2, 5]

Just in case speed matters, here are some timings:

# file: test.py
import collections

def thg435(l):
    return [x for x, y in collections.Counter(l).items() if y > 1]

def moooeeeep(l):
    seen = set()
    seen_add = seen.add
    # adds all elements it doesn't know yet to seen and all other to seen_twice
    seen_twice = set( x for x in l if x in seen or seen_add(x) )
    # turn the set into a list (as requested)
    return list( seen_twice )

def RiteshKumar(l):
    return list(set([x for x in l if l.count(x) > 1]))

def JohnLaRooy(L):
    seen = set()
    seen2 = set()
    seen_add = seen.add
    seen2_add = seen2.add
    for item in L:
        if item in seen:
            seen2_add(item)
        else:
            seen_add(item)
    return list(seen2)

l = [1,2,3,2,1,5,6,5,5,5]*100

Here are the results: (well done @JohnLaRooy!)

$ python -mtimeit -s 'import test' 'test.JohnLaRooy(test.l)'
10000 loops, best of 3: 74.6 usec per loop
$ python -mtimeit -s 'import test' 'test.moooeeeep(test.l)'
10000 loops, best of 3: 91.3 usec per loop
$ python -mtimeit -s 'import test' 'test.thg435(test.l)'
1000 loops, best of 3: 266 usec per loop
$ python -mtimeit -s 'import test' 'test.RiteshKumar(test.l)'
100 loops, best of 3: 8.35 msec per loop

Interestingly, besides the timings itself, also the ranking slightly changes when pypy is used. Most interestingly, the Counter-based approach benefits hugely from pypy's optimizations, whereas the method caching approach I have suggested seems to have almost no effect.

$ pypy -mtimeit -s 'import test' 'test.JohnLaRooy(test.l)'
100000 loops, best of 3: 17.8 usec per loop
$ pypy -mtimeit -s 'import test' 'test.thg435(test.l)'
10000 loops, best of 3: 23 usec per loop
$ pypy -mtimeit -s 'import test' 'test.moooeeeep(test.l)'
10000 loops, best of 3: 39.3 usec per loop

Apparantly this effect is related to the "duplicatedness" of the input data. I have set l = [random.randrange(1000000) for i in xrange(10000)] and got these results:

$ pypy -mtimeit -s 'import test' 'test.moooeeeep(test.l)'
1000 loops, best of 3: 495 usec per loop
$ pypy -mtimeit -s 'import test' 'test.JohnLaRooy(test.l)'
1000 loops, best of 3: 499 usec per loop
$ pypy -mtimeit -s 'import test' 'test.thg435(test.l)'
1000 loops, best of 3: 1.68 msec per loop

Just curious - what's the purpose of seen_add = seen.add here? — Jura, Aug 22 '12 at 19:19
@Rob This way you just call the function you've looked up once before. Otherwise you would need to look up (a dictionary query) the member function add every time an insert would be necessary. — moooeeeep, Aug 29 '12 at 20:09
checked with my own data and Ipython's %timeit your method does look fastest on test BUT: "The slowest run took 4.34 times longer than the fastest. This could mean that an intermediate result is being cached" — Joop, May 6 '15 at 7:14
@moooeeeep, I added another version to your script for you to try :) Also try pypy if you have it handy and are going for speed. — John La Rooy, Dec 17 '15 at 20:00
@JohnLaRooy Nice improvement in performance! Interestingly, when I used pypy to evaluate the results, the Counter-based approach improves significantly. — moooeeeep, Dec 18 '15 at 7:28

F1Rumors · Answer 4 · 2015-12-29 22:40:51Z

I came across this question whilst looking in to something related - and wonder why no-one offered a generator based solution? Solving this problem would be:

>>> print list(getDupes_9([1,2,3,2,1,5,6,5,5,5]))
[1, 2, 5]

I was concerned with scalability, so tested several approaches, including naive items that work well on small lists, but scale horribly as lists get larger (note- would have been better to use timeit, but this is illustrative).

I included @moooeeeep for comparison (it is impressively fast: fastest if the input list is completely random) and an itertools approach that is even faster again for mostly sorted lists... Now includes pandas approach from @firelynx -- slow, but not horribly so, and simple. Note - sort/tee/zip approach is consistently fastest on my machine for large mostly ordered lists, moooeeeep is fastest for shuffled lists, but your mileage may vary.

Advantages

very quick simple to test for 'any' duplicates using the same code

Assumptions

Duplicates should be reported once only
Duplicate order does not need to be preserved
Duplicate might be anywhere in the list

Fastest solution, 1m entries:

def getDupes(c):
        '''sort/tee/izip'''
        a, b = itertools.tee(sorted(c))
        next(b, None)
        r = None
        for k, g in itertools.izip(a, b):
            if k != g: continue
            if k != r:
                yield k
                r = k

Approaches tested

import itertools
import time
import random

def getDupes_1(c):
    '''naive'''
    for i in xrange(0, len(c)):
        if c[i] in c[:i]:
            yield c[i]

def getDupes_2(c):
    '''set len change'''
    s = set()
    for i in c:
        l = len(s)
        s.add(i)
        if len(s) == l:
            yield i

def getDupes_3(c):
    '''in dict'''
    d = {}
    for i in c:
        if i in d:
            if d[i]:
                yield i
                d[i] = False
        else:
            d[i] = True

def getDupes_4(c):
    '''in set'''
    s,r = set(),set()
    for i in c:
        if i not in s:
            s.add(i)
        elif i not in r:
            r.add(i)
            yield i

def getDupes_5(c):
    '''sort/adjacent'''
    c = sorted(c)
    r = None
    for i in xrange(1, len(c)):
        if c[i] == c[i - 1]:
            if c[i] != r:
                yield c[i]
                r = c[i]

def getDupes_6(c):
    '''sort/groupby'''
    def multiple(x):
        try:
            x.next()
            x.next()
            return True
        except:
            return False
    for k, g in itertools.ifilter(lambda x: multiple(x[1]), itertools.groupby(sorted(c))):
        yield k

def getDupes_7(c):
    '''sort/zip'''
    c = sorted(c)
    r = None
    for k, g in zip(c[:-1],c[1:]):
        if k == g:
            if k != r:
                yield k
                r = k

def getDupes_8(c):
    '''sort/izip'''
    c = sorted(c)
    r = None
    for k, g in itertools.izip(c[:-1],c[1:]):
        if k == g:
            if k != r:
                yield k
                r = k

def getDupes_9(c):
    '''sort/tee/izip'''
    a, b = itertools.tee(sorted(c))
    next(b, None)
    r = None
    for k, g in itertools.izip(a, b):
        if k != g: continue
        if k != r:
            yield k
            r = k

def getDupes_a(l):
    '''moooeeeep'''
    seen = set()
    seen_add = seen.add
    # adds all elements it doesn't know yet to seen and all other to seen_twice
    for x in l:
        if x in seen or seen_add(x):
            yield x

def getDupes_b(x):
    '''iter*/sorted'''
    x = sorted(x)
    def _matches():
        for k,g in itertools.izip(x[:-1],x[1:]):
            if k == g:
                yield k
    for k, n in itertools.groupby(_matches()):
        yield k

def getDupes_c(a):
    '''pandas'''
    import pandas as pd
    vc = pd.Series(a).value_counts()
    i = vc[vc > 1].index
    for _ in i:
        yield _

def hasDupes(fn,c):
    try:
        if fn(c).next(): return True    # Found a dupe
    except StopIteration:
        pass
    return False

def getDupes(fn,c):
    return list(fn(c))

STABLE = True
if STABLE:
    print 'Finding FIRST then ALL duplicates, single dupe of "nth" placed element in 1m element array'
else:
    print 'Finding FIRST then ALL duplicates, single dupe of "n" included in randomised 1m element array'
for location in (50,250000,500000,750000,999999):
    for test in (getDupes_2, getDupes_3, getDupes_4, getDupes_5, getDupes_6,
                 getDupes_8, getDupes_9, getDupes_a, getDupes_b, getDupes_c):
        print 'Test %-15s:%10d - '%(test.__doc__ or test.__name__,location),
        deltas = []
        for FIRST in (True,False):
            for i in xrange(0, 5):
                c = range(0,1000000)
                if STABLE:
                    c[0] = location
                else:
                    c.append(location)
                    random.shuffle(c)
                start = time.time()
                if FIRST:
                    print '.' if location == test(c).next() else '!',
                else:
                    print '.' if [location] == list(test(c)) else '!',
                deltas.append(time.time()-start)
            print ' -- %0.3f  '%(sum(deltas)/len(deltas)),
        print
    print

The results for the 'all dupes' test were consistent, finding "first" duplicate then "all" duplicates in this array:

Finding FIRST then ALL duplicates, single dupe of "nth" placed element in 1m element array
Test set len change :    500000 -  . . . . .  -- 0.264   . . . . .  -- 0.402  
Test in dict        :    500000 -  . . . . .  -- 0.163   . . . . .  -- 0.250  
Test in set         :    500000 -  . . . . .  -- 0.163   . . . . .  -- 0.249  
Test sort/adjacent  :    500000 -  . . . . .  -- 0.159   . . . . .  -- 0.229  
Test sort/groupby   :    500000 -  . . . . .  -- 0.860   . . . . .  -- 1.286  
Test sort/izip      :    500000 -  . . . . .  -- 0.165   . . . . .  -- 0.229  
Test sort/tee/izip  :    500000 -  . . . . .  -- 0.145   . . . . .  -- 0.206  *
Test moooeeeep      :    500000 -  . . . . .  -- 0.149   . . . . .  -- 0.232  
Test iter*/sorted   :    500000 -  . . . . .  -- 0.160   . . . . .  -- 0.221  
Test pandas         :    500000 -  . . . . .  -- 0.493   . . . . .  -- 0.499

When the lists are shuffled first, the price of the sort becomes apparent - the efficiency drops noticeably and the @moooeeeep approach dominates, with set & dict approaches being similar but lessor performers:

Finding FIRST then ALL duplicates, single dupe of "n" included in randomised 1m element array
Test set len change :    500000 -  . . . . .  -- 0.321   . . . . .  -- 0.473  
Test in dict        :    500000 -  . . . . .  -- 0.285   . . . . .  -- 0.360  
Test in set         :    500000 -  . . . . .  -- 0.309   . . . . .  -- 0.365  
Test sort/adjacent  :    500000 -  . . . . .  -- 0.756   . . . . .  -- 0.823  
Test sort/groupby   :    500000 -  . . . . .  -- 1.459   . . . . .  -- 1.896  
Test sort/izip      :    500000 -  . . . . .  -- 0.786   . . . . .  -- 0.845  
Test sort/tee/izip  :    500000 -  . . . . .  -- 0.743   . . . . .  -- 0.804  
Test moooeeeep      :    500000 -  . . . . .  -- 0.234   . . . . .  -- 0.311  *
Test iter*/sorted   :    500000 -  . . . . .  -- 0.776   . . . . .  -- 0.840  
Test pandas         :    500000 -  . . . . .  -- 0.539   . . . . .  -- 0.540

@moooeeeep - be interested to see your views on the ifilter/izip/tee approach. — F1Rumors, Jul 17 '15 at 16:22
this answer is incredibly good.I don't understand that it didn't had more points for the explanations and tests which are very usefull for those that would need it. — dlewin, Aug 17 '15 at 7:48
python's sort is O(n) when only one item is out of order. You should random.shuffle(c) to account for that. Additionally I can't replicate your results when running the unaltered script either (totally different ordering), so maybe it is dependent on the CPU too. — John La Rooy, Dec 17 '15 at 19:38
Thank you @John-La-Rooy, astute observation on the CPU/local machine being impactful - so I should amend the item YYMV. Using the O(n) sort was deliberate: the duplicative element is inserted at different locations specifically to see the impact of the approach if there is a sole duplicate in a good (start of list) or bad (end of list) location with these approaches. I considered a random list - eg random.shuffle - but decided that would only be sensible if I did a lot more runs! I shall have to return and benchmark a multiple run/shuffle equivalent and see what the impact is. — F1Rumors, Dec 29 '15 at 19:15
Amended to include @firelynx pandas approach & to run on fully shuffled list as well as sorted list. This is because the native timsort used by Python is wicked fast on mostly sorted data (best case) and the shuffled lists are its worst case scenario - that shakes up the results. — F1Rumors, Dec 30 '15 at 16:56

georg · Answer 5 · 2012-03-23 09:29:34Z

up vote 8 down vote

collections.Counter is new in python 2.7:


Python 2.5.4 (r254:67916, May 31 2010, 15:03:39) 
[GCC 4.1.2 20080704 (Red Hat 4.1.2-46)] on linux2
a = [1,2,3,2,1,5,6,5,5,5]
import collections
print [x for x, y in collections.Counter(a).items() if y > 1]
Type "help", "copyright", "credits" or "license" for more information.
  File "", line 1, in 
AttributeError: 'module' object has no attribute 'Counter'
>>>

In an earlier version you can use a conventional dict instead:

a = [1,2,3,2,1,5,6,5,5,5]
d = {}
for elem in a:
    if elem in d:
        d[elem] += 1
    else:
        d[elem] = 1

print [x for x, y in d.items() if y > 1]

edited Mar 23 '12 at 9:29

georg

109k22150227

answered Mar 23 '12 at 8:52

Edward

1023

Given the vague question, I don't think it's a bad answer. – jdborg Oct 29 '12 at 11:05

@JohnLaRooy In what way is the answer you replied to not an answer? – sasha Dec 17 '15 at 12:16

@sasha, when I replied, it was another question - see the edit history @ Mar 23 '12 at 9:00 :) but I will delete that comment now. – John La Rooy Dec 17 '15 at 19:42

add a comment |

firelynx · Answer 6 · 2015-12-09 09:52:43Z

I would do this with pandas, because I use pandas a lot

import pandas as pd
a = [1,2,3,3,3,4,5,6,6,7]
vc = pd.Series(a).value_counts()
vc[vc > 1].index.tolist()

Gives

[3,6]

Probably isn't very efficient, but it sure is less code than a lot of the other answers, so I thought I would contribute

Nikhil Prabhu · Answer 7 · 2016-07-15 17:29:47Z

up vote 4 down vote

Here's a neat and concise solution -

for x in set(li):
    li.remove(x)

li = list(set(li))

answered Jul 15 '16 at 17:29

Nikhil Prabhu

1739

The original list is lost, though. This can be fixed by copying the contents to another list - temp = li[:] – Nikhil Prabhu Jul 15 '16 at 17:32

add a comment |

user3109122 · Answer 8 · 2015-05-27 04:09:56Z

up vote 2 down vote

A bit late, but maybe helpful for some. For a largish list, I found this worked for me.

l=[1,2,3,5,4,1,3,1]
s=set(l)
d=[]
for x in l:
    if x in s:
        s.remove(x)
    else:
        d.append(x)
d
[1,3,1]

Shows just and all duplicates and preserves order.

answered May 27 '15 at 4:09

user3109122

411

Doesn't work with a list of lists. – vanguard69 Jul 7 '15 at 11:04

add a comment |

wordsmith · Answer 9 · 2016-10-08 11:10:45Z

up vote 2 down vote

Using pandas:

>>> import pandas as pd
>>> a = [1, 2, 1, 3, 3, 3, 0]
>>> pd.Series(a)[pd.Series(a).duplicated()].values
array([1, 3, 3])

answered Oct 8 '16 at 11:10

wordsmith

1308

add a comment |

yota · Answer 10 · 2016-02-28 17:51:06Z

the third example of the accepted answer give an erroneous answer and does not attempt to give duplicates. Here is the correct version :

number_lst = [1, 1, 2, 3, 5, ...]

seen_set = set()
duplicate_set = set(x for x in number_lst if x in seen_set or seen_set.add(x))
unique_set = seen_set - duplicate_set

Igor Vishnevskiy · Answer 11 · 2016-06-21 02:32:19Z

Very simple and quick way of finding dupes with one iteration in Python is:

testList = ['red', 'blue', 'red', 'green', 'blue', 'blue']

testListDict = {}

for item in testList:
  try:
    testListDict[item] += 1
  except:
    testListDict[item] = 1

print testListDict

Output will be as follows:

>>> print testListDict
{'blue': 3, 'green': 1, 'red': 2}

This and more in my blog http://www.howtoprogramwithpython.com

Haresh Shyara · Answer 12 · 2015-02-11 12:28:05Z

up vote 0 down vote

list2 = [1, 2, 3, 4, 1, 2, 3]
lset = set()
[(lset.add(item), list2.append(item))
 for item in list2 if item not in lset]
print list(lset)

answered Feb 11 '15 at 12:28

Haresh Shyara

706210

add a comment |

ytpillai · Answer 13 · 2015-07-11 17:10:23Z

up vote 0 down vote

One line solution:

set([i for i in list if sum([1 for a in list if a == i]) > 1])

answered Jul 11 '15 at 17:10

ytpillai

1,320924

Returns a dictionary though – ytpillai Jul 11 '15 at 17:12

not a dictionnary :) a set() – yota Mar 10 '16 at 14:03

Ah yes I was not the smartest back then – ytpillai Mar 10 '16 at 15:01

add a comment |

tvt173 · Answer 14 · 2016-03-10 22:12:12Z

There are a lot of answers up here, but I think this is relatively a very readable and easy to understand approach:

def get_duplicates(sorted_list):
    duplicates = []
    last = sorted_list[0]
    for x in sorted_list[1:]:
        if x == last:
            duplicates.append(x)
        last = x
    return set(duplicates)

Notes:

If you wish to preserve duplication count, get rid of the cast to 'set' at the bottom to get the full list
If you prefer to use generators, replace duplicates.append(x) with yield x and the return statement at the bottom (you can cast to set later)

John B · Answer 15 · 2017-02-12 03:52:55Z

Here's a fast generator that uses a dict to store each element as a key with a boolean value for checking if the duplicate item has already been yielded.

For lists with all elements that are hashable types:

def gen_dupes(array):
    unique = {}
    for value in array:
        if value in unique and unique[value]:
            unique[value] = False
            yield value
        else:
            unique[value] = True

array = [1, 2, 2, 3, 4, 1, 5, 2, 6, 6]
print(list(gen_dupes(array)))
# => [2, 1, 6]

For lists that might contain lists:

def gen_dupes(array):
    unique = {}
    for value in array:
        is_list = False
        if type(value) is list:
            value = tuple(value)
            is_list = True

        if value in unique and unique[value]:
            unique[value] = False
            if is_list:
                value = list(value)

            yield value
        else:
            unique[value] = True

array = [1, 2, 2, [1, 2], 3, 4, [1, 2], 5, 2, 6, 6]
print(list(gen_dupes(array)))
# => [2, [1, 2], 6]

MSeifert · Answer 16 · 2017-04-04 19:39:03Z

You can use iteration_utilities.duplicates:

>>> from iteration_utilities import duplicates

>>> list(duplicates([1,1,2,1,2,3,4,2]))
[1, 1, 2, 2]

or if you only want one of each duplicate this can be combined with iteration_utilities.unique_everseen:

>>> from iteration_utilities import unique_everseen

>>> list(unique_everseen(duplicates([1,1,2,1,2,3,4,2])))
[1, 2]

^{1 This is from a third-party library I have written: iteration_utilities.}

HenryDev · Answer 17 · 2017-05-21 16:40:54Z

How about simply loop through each element in the list by checking the number of occurrences, then adding them to a set which will then print the duplicates. Hope this helps someone out there.

myList  = [2 ,4 , 6, 8, 4, 6, 12];
newList = set()

for i in myList:
    if myList.count(i) >= 2:
        newList.add(i)

print(list(newList))
## [4 , 6]

Matt S · Answer 18 · 2016-02-05 14:36:35Z

this is the way I had to do it because I challenged myself not to use other methods:

def dupList(oldlist):
    if type(oldlist)==type((2,2)):
        oldlist=[x for x in oldlist]
    newList=[]
    newList=newList+oldlist
    oldlist=oldlist
    forbidden=[]
    checkPoint=0
    for i in range(len(oldlist)):
        #print 'start i', i
        if i in forbidden:
            continue
        else:
            for j in range(len(oldlist)):
                #print 'start j', j
                if j in forbidden:
                    continue
                else:
                    #print 'after Else'
                    if i!=j: 
                        #print 'i,j', i,j
                        #print oldlist
                        #print newList
                        if oldlist[j]==oldlist[i]:
                            #print 'oldlist[i],oldlist[j]', oldlist[i],oldlist[j]
                            forbidden.append(j)
                            #print 'forbidden', forbidden
                            del newList[j-checkPoint]
                            #print newList
                            checkPoint=checkPoint+1
    return newList

so your sample works as:

>>>a = [1,2,3,3,3,4,5,6,6,7]
>>>dupList(a)
[1, 2, 3, 4, 5, 6, 7]

This is not what the OP wanted. He wanted a list of the duplicates, not a list with the duplicates removed. To make a list with the duplicates removed, I would suggest duplist = list(set(a)). — zondo, Feb 5 '16 at 14:56

Carpetsmoker · Answer 19 · 2017-01-03 20:54:09Z

up vote -2 down vote

Use the sort() function. Duplicates can be identified by looping over it and checking l1[i] == l1[i+1].

edited Jan 3 at 20:54

Carpetsmoker

13.8k124568

answered Feb 10 '16 at 15:25

user5909101

1

add a comment |

asked	5 years, 2 months ago
viewed	220,541 times
active	19 days ago

Find and list duplicates in a list?

19 Answers 19

protected by Community♦ Feb 17 '16 at 12:10

Not the answer you're looking for? Browse other questions tagged python list duplicates or ask your own question.

Linked

Hot Network Questions

Find and list duplicates in a list?

19 Answers 19

protected by Community♦ Feb 17 '16 at 12:10

Not the answer you're looking for? Browse other questions tagged python list duplicates or ask your own question.

Linked

Related

Hot Network Questions