Dear All,
Thank you all very much for your feedback! Actually, the number of
collisions didn't decrease when I increased the bit length, though
increasing radius to 3 did help a bit. Overall, it is good to know that
great results are not to be expected.
Best wishes,
Michal

On Wed, 10 Oct 2018 at 13:31, Chris Earnshaw <cgearns...@gmail.com> wrote:

> Hi
>
> It sounds to me like you're already getting better results than you could
> reasonably expect.
>
> Prediction of melting point is a phenomenally difficult thing to do;
> you're trying to find the temperature at which a (generally undefined)
> solid crystalline phase is in equilibrium with a (probably even less
> defined) liquid phase. You also need to consider that the crystalline form
> of your solid phase is not necessarily truly constant - what polymorph is
> involved? Melting points of alternative polymorphs can be radically
> different and this is one of the real bugbears of pharmaceutical and
> agrochemical development. If you haven't found the most stable form early
> in the development process there can be very nasty surprises downstream.
>
> Expecting to handle all these challenges with a descriptor as simple as a
> molecular fingerprint - regardless of bit-length, collisions etc. is
> probably over optimistic...
>
> Regards,
> Chris Earnshaw
>
> On Wed, 10 Oct 2018 at 13:16, Michal Krompiec <michal.kromp...@gmail.com>
> wrote:
>
>> Hi Thomas,
>> Radius 2, 2048 bits, 5200 data points.
>>
>> On Wed, 10 Oct 2018 at 13:13, Thomas Evangelidis <teva...@gmail.com>
>> wrote:
>>
>>> What's your bitvector length and radius? How many training samples do
>>> you have?
>>>
>>> On Wed, 10 Oct 2018 at 13:51, Michal Krompiec <michal.kromp...@gmail.com>
>>> wrote:
>>>
>>>> Hi all,
>>>> I have a slightly off-topic question. I'm trying to train a neural
>>>> network on a dataset of small molecules and their melting points. I did get
>>>> a not-so-bad accuracy with Morgan fingerprints, but I've realised that
>>>> regardless of FP radius and bitvector length, several dozen molecules have
>>>> the same fingerprints but wildly different melting points. I am pretty sure
>>>> this is a "solved problem" so I don't want to reinvent the wheel. What is
>>>> the recommended/usual way of dealing with this?
>>>> Thanks,
>>>> Michal
>>>>
>>>>
>>>> _______________________________________________
>>>> Rdkit-discuss mailing list
>>>> Rdkit-discuss@lists.sourceforge.net
>>>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>>>
>>>
>>>
>>> --
>>>
>>> ======================================================================
>>>
>>> Dr Thomas Evangelidis
>>>
>>> Research Scientist
>>>
>>> IOCB - Institute of Organic Chemistry and Biochemistry of the Czech
>>> Academy of Sciences <https://www.uochb.cz/web/structure/31.html?lang=en>
>>> Prague, Czech Republic
>>>   &
>>> CEITEC - Central European Institute of Technology
>>> <https://www.ceitec.eu/>
>>> Brno, Czech Republic
>>>
>>> email: teva...@gmail.com
>>>
>>> website: https://sites.google.com/site/thomasevangelidishomepage/
>>>
>>>
>>> _______________________________________________
>> Rdkit-discuss mailing list
>> Rdkit-discuss@lists.sourceforge.net
>> https://lists.sourceforge.net/lists/listinfo/rdkit-discuss
>>
>
_______________________________________________
Rdkit-discuss mailing list
Rdkit-discuss@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/rdkit-discuss

Reply via email to