Monday, 23 April 2018

Sistema de desconto reversível comercial


Forex enigma indicator download grátis.
Baixe o software gratuito Scalping Forex Enigma 2018.
10/4/2018 & # 0183; & # 32; Baixe o software gratuito Scalping Forex Enigma 2018. revela o incrível indicador secreto que poderia ter alguém lucrativo. Baixe Forex Enigma ...
Forex Enigma - FX - Onlineshop Premium Forex Shop.
www fxenigma com Indicador ForexEnigma ex4 V4 0-Documentos desbloqueados 1 ForexEnigma. Forex Sniper-EA. 4x Pip. Scalper EA. Forex Enigma $ 25.00. CÓDIGO: fxenigma.
Download do indicador de enigma forex - Markets Watcher.
Lankanews3.blogspot livre Guia Forex da DAS HOLDINGS você pode encontrar quando o mercado é forte e fraco. boa sorte. & # 215;
Forex Enigma | Free Download Internet Marketing - Negócios.
Forex Enigma download, Forex Enigma Link do produto: tradersoffer / product / forex - enigma / Indicador:. Download gratuito de Marketing na Internet - Cursos de negócios.
Forex Enigma Review - MELHOR MANEIRA PARA GANHAR US $ 1000 POR ...
Forex Enigma Review - MELHOR FORMA DE GANHAR US $ 1000. Forex Enigma indicator Free forex enigma download grátis. Forex Enigma EA vem com um 5 inteligente incorporado.
Forex Enigma - Comentários.
Se você perder mais do que você faz no Forex ... então eu tenho uma solução para você. Forex Enigma é um novo e exclusivo indicador de scalping projetado para os prazos M1 e M5.
Metatrader 4 Indicadores - Estratégias de negociação Forex grátis.
Explore nossa enorme coleção de indicadores Metatrader 4. Baixe instantaneamente indicadores comerciais gratuitos para a plataforma MT4, incluindo sinais, osciladores.
Forex Enigma Scalping Indicator Karll Dittmann grátis.
Forex Enigma Scalping Indicator by Karll Dittmann download gratuito. De volta ao 16/07/2018 postei uma perspectiva diária do par USDJPY. Naquele momento eu estava.
Indicadores MT4 | Livre MetaTrader 4 Indicadores Download.
Coleção de downloads de indicadores MT4 gratuitos - Tendência, Momento, Previsão, Volatilidade, Volume, indicadores estocásticos de divisas e muito mais! Obtenha seu download agora.
Forex Enigma Reivews | Top Trading System Review.
Home Forex Trading / Forex Trading Systems Forex Enigma Reivews. Forex Enigma Reivews. Forex. Indicador Forex Fxenigma. O sistema oferece alta livre.

Sistema de desconto de contas a receber esperado mais tarde este fiscal: Raghuram Rajan.
O governador da Reserva do Banco da Índia, Ragharam Rajan, disse hoje que o Sistema de desconto de recebíveis de negócios (TReDS) deverá lançar-se mais tarde, este fiscal, enquanto o Bharat Bill Payments System (BBPS) será lançado em breve.
O Sistema de Pagamentos de Bharat Bill (BBPS) foi implementado para atender às necessidades do grande mercado de pagamentos de contas, para proporcionar conveniência a qualquer momento, em qualquer lugar, qualquer facilidade de pagamento de contas para os usuários. É esperado que seja lançado em breve, & # 8221; Rajan disse enquanto se dirigia em uma função de premiação organizada pelo Institute for Development and Research in Banking Technologies.
Outra área importante de desenvolvimento foi o Sistema de desconto de contas a receber (TReDS), que deveria ser um cambiador de jogo para atender às necessidades de financiamento do segmento Micro, Pequenas e Médias Empresas (MSME).
& # 8220; as MPMEs são espremidas o tempo todo por seus grandes compradores, que pagam após longos atrasos. Tudo seria melhor se as MPME pudessem vender sua reivindicação no grande comprador no mercado. O MSME ganharia seu dinheiro rapidamente, enquanto o mercado obtesse uma reclamação no melhor comprador classificado melhor em vez de segurar uma reclamação nas MSME. Tudo isso acontecerá à medida que os três Sistemas de desconhecimento de contas a receber (TReDS) que o RBI licenciou, comece mais tarde neste exercício, & # 8221; ele disse.
O chefe da RBI também disse que a chave é reduzir os custos de transação automatizando quase todos os aspectos da transação, de modo que mesmo as mais pequenas mídias possam se beneficiar.
Falando sobre o desenvolvimento recente dos sistemas de pagamento no setor bancário, o governador disse que o RBI gostaria que os mecanismos de pagamento fossem baratos e escaláveis, de modo que eles se adequassem à economia da Índia, onde os tamanhos dos bilhetes são pequenos mas volumes de transações enormes # 8221 ;.
Um elemento desejável adicional seria para os diferentes sistemas de pagamento serem interoperáveis ​​- ou pelo menos poder conversar entre si, & # 8221; ele acrescentou ainda que o país precisa de sistemas que ofereçam segurança, mesmo para o usuário não sofisticado, e não colocam uma enorme carga de cuidados neles.
Receba as últimas notícias e atualizações no Auto Expo 2018, verifique as novidades no Orçamento 2018, como nós no Facebook e siga-nos no Twitter.
Sem comentários.
Direitos autorais e cópia; The Indian Express [P] Ltd. Todos os direitos reservados.

RBI emitiu diretrizes para a criação de sistema de desconto de contas a receber para MPMEs.
jagranjosh.
O Banco de Reserva da Índia, em 23 de julho de 2018, emitiu um projeto de diretrizes para a criação e operação de um Sistema de desconto de recebíveis comerciais, a fim de facilitar o financiamento a micro, pequenas e médias empresas (MPMEs) e para ajudá-los a converter créditos comerciais em fundos líquidos.
• O Sistema de desconto de contas a receber (TReDS) proporcionará a plataforma para reunir esses participantes para facilitar o upload, aceitação, desconto, negociação e liquidação das faturas ou contas das MPMEs.
• O TReDS também pode introduzir algumas auditorias aleatórias para garantir que não haja vestimenta e que as unidades de factoring carregadas na troca sejam autênticas e com base em transações genuínas.
• A TReDS colocaria em prática um mecanismo / processo padronizado para embarque de compradores e vendedores no TREDS. Este processo único no embarque exigirá que as entidades enviem todos os documentos relacionados ao KYC para o TReDS.
• O TReDS seria regido pelo quadro regulamentar implementado pelo Reserve Bank of India, no âmbito do Payment and settlement Systems Act 2007.
• TRÉDICOS não serão autorizados a assumir qualquer risco de crédito e o capital social com direito a voto mínimo deve ser de 100 rupias. Além disso, os TRÉDIOS devem ter um patrimônio líquido de 100 rupias em todo momento.
• A contribuição inicial mínima do promotor para o capital social com direito a voto remunerado de TREDS deve ser de pelo menos 40%, que deverá ser bloqueada por um período de cinco anos a partir da data de início do negócio dos TRÉDIOS.
• A participação de promotores em TReDS em excesso de 40% será reduzida para 40% no prazo de três anos a partir da data de início do negócio da TReDS.
• A participação estrangeira no TREDS seria conforme à política de IDE existente.
• O TReDS deve ter uma base tecnológica sólida no mínimo para suportar suas operações.
O Sistema de descontos de contas a receber de clientes (TReDS) é o esquema para a criação e operação de um mecanismo institucional para facilitar o financiamento de contas a receber comerciais de MPMEs de compradores corporativos através de múltiplos financiadores.
Se você tiver alguma pergunta / ponto sobre as informações acima, por favor, pergunte / discuta no Grupo de Assuntos Atuais.

Sistema de desconto reversível comercial
ESTADOS UNIDOS DA AMERICA.
COMMODITY FUTURES TRADING COMMISSION.
Na questão de:
Wayne I. Elliott.
Documento CFTC No. 95-1.
Francis Maritote, J. Brian Schaer, e: OPINIÃO E ORDEM.
Jonathan A. Sion,
A Divisão de Execução ("Divisão") apelou da demissão de um juiz de direito administrativo de uma denúncia de três conteúdos cobrando aos inquiridos violações da prática comercial em relação a 32 operações envolvendo futuros de trigo de março de 1991 no Chicago Board of Trade ("CBOT"). A queixa de dezembro de 1994 cobrou os inquiridos com violação da Seção 4c (a) (A) do Commodity Exchange Act ("CEA" ou da "Lei"), 7 U. S.C. § 6c (a) (A) (1994), contratando vendas de lavagem, violando a Seção 1.38 (a) dos Regulamentos da Comissão, 17 C. F.R. § 1.38 (a) (1997), executando transações não competitivas e violando a Seção 4c (a) (B) da Lei, 7 U. S.C. § 6c (a) (B) (1994), por causa da divulgação de preços não-válidos. Na sequência de uma audiência de prova, o ALJ constatou que as provas apresentadas pela Divisão eram insuficientes para sustentar essas acusações.
A Divisão desafia as descobertas factuais do ALJ e sua análise legal e alega que o ALJ cometeu um erro reversível ao descontar o valor probatório do conjunto de provas circunstanciais da Divisão e, em vez disso, creditou a afirmação dos entrevistados de que os negócios eram executados de forma competitiva. Com base em nossa revisão independente do registro, concordamos que a deferência para a avaliação do registro do ALJ é inadequada neste caso. Conforme explicado mais detalhadamente abaixo, invertimos a decisão inicial, ordenamos que cada um dos inquiridos cesse e desista de futuras violações da Lei e dos regulamentos da Comissão, impõe suspensões de seis meses a cada um dos inquiridos e avalie as penalidades monetárias civis no valor de US $ 50.000.
Comércio de trigo e o ciclo de entrega.
As transações em questão são operações de spread no contrato de futuros do trigo CBOT. (1)
Os meses de entrega para o contrato de futuros do trigo CBOT são março, maio, julho, setembro e dezembro. O mês de entrega de futuros que é mais próximo do prazo de maturidade é designado como o "mês próximo". O mês de entrega de futuros mais distante é designado como "mês de diferimento". Glossário às 13. Comprar o mês próximo e vender o mês diferido é conhecido como "comprar o spread", enquanto o reverso é chamado de "venda do spread".
No CBOT, os detentores de posições longas recebem entrega em uma "primeira entrada em primeiro lugar". Assim, o comprador com o contrato longo aberto mais antigo recebe o primeiro aviso de entrega e o comprador com o próximo contrato aberto mais antigo recebe o próximo aviso de entrega. CBOT Regra 1048.01. (2) Esta classificação de todos os detentores de posição longa resulta em uma linha de entrega organizada em ordem cronológica com o comerciante que detém a posição longa aberta mais antiga localizada na frente da linha e o comerciante que mantém a posição longa comprada mais recentemente na parte de trás de a linha.
O CBOT relata posições na linha de entrega no relatório "Entregas de Última Contagem Comercial Data". Outro relatório do CBOT, o "Editar e Parar o Listado", é publicado diariamente, começando no primeiro dia de notificação e terminando no último dia de calendário do mês de entrega. Para cada empresa de compensação, o relatório calcula o número total de bushels contidos nos avisos de entrega que a empresa de compensação emitiu no dia anterior. Ele também fornece o número total de bushels contidos nos avisos de entrega que a empresa de compensação recebeu, ou "parado", no dia anterior. Tr. em 94, 179-80, 536-39, 549-52, 553-54, 558, 573, 622-24, 641-43, 776-77, 778.
O sucesso de algumas estratégias de negociação depende da capacidade do comerciante de manter uma posição longa até o final do mês de entrega sem a entrega. Mover-se para a parte de trás da linha de entrega permite que um comerciante longo mantenha sua posição adiando a entrega. Para atingir o objetivo de se mudar para a parte de trás da linha de entrega, o comerciante deve liquidar sua posição longa original e depois estabelecer uma nova posição longa. Se o mercado se mover significativamente entre a liquidação da posição longa original e a abertura de uma nova posição, o comerciante pode sofrer uma perda na sua posição geral. Se o comerciante não se mover para a parte de trás da linha de entrega, no entanto, ele ou ela pode ser forçado a aceitar a entrega. (3)
Uma estratégia de negociação conhecida como "refrescar" é usada por detentores de posição longa que desejam manter suas posições até o final do mês de entrega. Como a entrega é atribuída aos compradores em uma primeira entrada, primeira saída, um comerciante que quer manter posições longas agirá para mover para trás da linha de entrega. O comerciante liquida sua posição atual, compensando-a contra sua posição mais antiga no mesmo contrato e estabelecendo uma posição longa equivalente no mesmo contrato. Uma vez que a nova posição foi estabelecida mais tarde, o comerciante difere a entrega da entrega. (4)
Muitos comerciantes optam por não manter posições abertas no mês de entrega ou são proibidas pelas empresas de compensação de serem negociadas durante o período de entrega por causa dos requisitos de crédito associados ao transporte da mercadoria em dinheiro. Assim, o número de comerciantes ativos em março de 1991 o trigo diminuiu à medida que a entrega se aproximava. Durante o período em questão, o número de comerciantes participantes variou de 41 em 26 de fevereiro a 7 de março em 13 de março. Divisão Ex. 14; Resp. Ex. D.
A negociação desafiada.
Durante o período de tempo relevante, os quatro inquiridos eram membros do CBOT e foram registrados na Comissão como intermediários. Cada um dos entrevistados era um comerciante ativo em spreads de grande volume, e todos ocuparam posições longas no contrato de março de 1991 e posições curtas nos contratos de maio e julho de 1991.
Os negócios contestados ocorreram entre 25 de fevereiro e 13 de março de 1991. Todos, exceto um dos negócios, excederam o grande nível de relatório de comerciantes da Comissão de 500 mil alqueireiros, e cada uma das negociações representou uma porcentagem significativa do volume de março de 1991 nas datas em questão . Rooney, declaração verificada em 54 ("Rooney"). Os cartões de negociação para cada um dos entrevistados para os seus spreads de trigo durante o período de entrega de fevereiro a março de 1991 revelaram 32 negociações em que os entrevistados, em várias combinações, compraram e vendiam quantidades idênticas do mesmo spread em diferenciais idênticos. Em cada comércio, três dos quatro entrevistados conseguiram se deslocar para a parte de trás da linha de entrega sem sofrer uma perda. Nenhum dos entrevistados percebeu um lucro ou incorrido em perda dessas negociações, e nenhum comerciante que não os entrevistados participou das negociações.
Evidência da Divisão.
A Divisão não desafiou a legitimidade do refresco como objetivo comercial, mas afirmou que os entrevistados usaram meios ilegais - vendas de lavagem pré-arranjadas - para atingir esse objetivo comercial. Hugh Rooney, investigador comercial de supervisão da Divisão, deu testemunho pericial na audiência. Ao chegar a sua conclusão de que a negociação dos entrevistados era mais provável que não competitiva do que competitiva, a Rooney analisou os cartões de negociação e as demonstrações de negociação diária para cada um dos inquiridos para cada um dos dias de negociação em questão. Ele também fez um exame abrangente dos componentes específicos de cada uma das negociações desafiadas, incluindo a data e o horário de execução, a quantidade negociada e o preço diferencial para cada comércio. (5) A Rooney observou que os entrevistados conseguiram obter liquidez significativa durante um período ilíquido. Ele também descobriu que, em cada uma das transações contestadas, os inquiridos participantes conseguiram obter um resultado de lavagem. (6)
Rooney também considerou significativo que, embora houvesse uma grande população de comerciantes de trigo ativos no poço durante parte do período relevante, nenhum outro comerciante participou das negociações desafiadas. (7) Rooney às 56-58. A este respeito, ele testificou que, embora os negócios provavelmente fossem feitos de forma audível e visível, o fato de que nenhum outro comerciante "pulou" sugeriu-lhe que o clamor não era genuíno. Tr. em 197-200.
Rooney também descreveu uma série de irregularidades na pista de auditoria e deficiências relacionadas aos negócios. Embora tenha admitido que erros ocasionais de registro ocorrem durante períodos de volatilidade de preços, ele declarou que as irregularidades específicas sugeriam comércio não competitivo, uma vez que não havia flutuação de preços. Ele testemunhou que muitas das iniciações e compensações comerciais foram registradas em linhas consecutivas ou no mesmo cartão comercial, do qual concluiu que os negócios foram executados em série ou simultaneamente. Rooney às 18, 63-64. Na sua opinião, trocas independentes e competitivas teriam sido executadas em momentos diferentes. Finalmente, Rooney deu sua opinião de que os entrevistados compartilhavam um motivo para evitar o risco de mercado ao melhorar suas posições, o que proporcionou um incentivo para que esses entrevistados entrassem em um acordo para operações não competitivas. Tr. em 192-93.
Na audiência, a Divisão estabeleceu que, em pelo menos um dos dias em questão, o entrevistado Schaer se comprometeu a se refrescar para o qual ele não foi acusado de negociação não competitiva. As características desse incidente de freshening diferiram marcadamente das características da negociação desafiada. Em 27 de fevereiro de 1991, o primeiro dia de posição para o contrato de trigo de março de 1991, o resfriamento de 1.010.000 bushels da Schaer envolveu dez negócios de rodada opostos a outros dez comerciantes em vários intervalos de tempo e a preços ligeiramente diferentes. Tr. às 54-58.
Em um contrato cobrado na mesma data, o entrevistado Elliott vendeu o entrevistado Sion 2,5 milhões de bushels do spread de março a maio em um único comércio. A Divisão estabeleceu que a Maritote precisava de menos de 1,5 milhão de bushels para ficar atrás da linha de entrega, mas, no entanto, comprou a Sion os 2,5 milhões de bushels que Sion precisava para ficar atrás da linha de entrega. Maritote vendeu então Elliott 2,5 milhões de bushels do spread de março a maio em duas transações. Tr. em 509-510. Todas as transações foram executadas em três colchetes comerciais, no mesmo diferencial sem lucro ou perda para nenhum dos inquiridos. Maritote afirmou que estava disposto a assumir essa grande posição porque sabia que poderia "afastá-la" sobre Elliott, que também estava oferecendo. Tr. em 514.
No cerne do caso dos entrevistados é sua afirmação de que os negócios contestados foram celebrados com um propósito de mercado legítimo. (8) Cada um testemunhou que sua negociação era independente, aberta e competitiva, e todos os entrevistados negaram a previsão de qualquer das negociações desafiadas. Eles sustentaram que cada um deles conseguiu estimar as necessidades comerciais dos outros, seja através de informações de entrega publicamente disponíveis ou observações inteligentes. Thomas Willis, um veterano comerciante da CBOT que testemunhou como especialista, analisou as declarações de investigação dos quatro entrevistados, bem como alguns dos cartões comerciais para os negócios contestados. Willis declarou que não via provas de negociação não competitiva. Tr. em 875.
Os entrevistados também ofereceram uma série de testemunhas de caráter que atestam a reputação dos inquiridos por honestidade e integridade. Stan Komparda lembrou-se de observar os comerciantes dos entrevistados para refrescar suas posições no momento relevante para a queixa. Ele testemunhou que ele se lembrou dessa negociação, porque era incomum que todos os quatro negociassem a opção vencida, e ele lembrou especificamente que eles estavam oferecendo e oferecendo por protesto aberto. Tr. em 741-45. Como outros membros da troca que testemunharam em nome dos entrevistados, Komparda não viu nada incomum sobre as configurações ilustradas pelos negócios diagramados de Rooney. Tanto Komparda quanto outra testemunha testemunharam que acreditavam que poderiam ter participado nesses negócios. Tr. em 745-47, 758.
A decisão inicial.
O ALJ descobriu que os entrevistados haviam testemunhado com sinceridade que todos os seus negócios eram executados de forma competitiva por protestos abertos e que qualquer comerciante presente no poço teve a oportunidade de participar das negociações desafiadas. Decisão inicial em 5. Ele concluiu que o padrão e a configuração das negociações desafiadas refletiam o conhecimento dos entrevistados de que uma ordem de compra ou venda por uma delas provavelmente seria seguida por uma transação reversa para ficar atrás da linha de entrega e ele creditou afirmação dos entrevistados de que as informações comerciais disponíveis para eles permitiram aos entrevistados antecipar as necessidades comerciais de cada um. (9)
O ALJ criticou o fracasso da Divisão em apresentar provas diretas de pré-disposição e declinou inferir o pré-arranjo das sete "características significativas" identificadas por Rooney: (10)
Tomados como um todo, essas características não comprovam que as transações contestadas foram o resultado de um acordo implícito ou expresso para convencer os negócios. As características comerciais citadas pela Divisão refletem um resultado natural de negócios que podem ocorrer em um mercado constrangido entre comerciantes de grande volume perspicazes e inteligentes que desejam garantir uma data de entrega posterior.
Decisão inicial em 11. (11) Ele concluiu que "os comerciantes que respeitam as regras do mercado não devem ser punidos por sua percepção e inteligência". Identidade. às 12, citando In re Angelo, [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) 24.943 em 37.436 (Decisão inicial, 19 de outubro de 1990, sumariamente afeitado em 24 de março de 1993). Neste contexto, o ALJ enfatizou sua obrigação de "abster-se de desenhar linhas que terão o efeito de proibir o comportamento econômico legítimo". Identidade. às 8, citando In re Buckwalter, [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 24,995 em 37,684 (CFTC 25 de janeiro de 1991).
Argumentos das Partes sobre o recurso.
A Divisão exorta a apelar que as conclusões factuais do ALJ e a análise legal refletem um erro claro e não têm direito à deferência. Ele cita duas "questões centrais de fato" sobre as quais afirma que o ALJ errou. Primeiro, a Divisão sustenta que o registro não suporta a conclusão do ALJ de que cada entrevistado poderia confiar em informações de entrega publicamente disponíveis para estimar as posições abertas dos outros respondentes e o volume que os outros precisariam negociar para se deslocarem para trás linha de entrega. A Divisão enfatiza que o número de emissão e parada do CBOT não estava disponível em pelo menos alguns dos dias de negociação em questão e em outros dias relevantes continham apenas dados parciais. Da mesma forma, a Divisão argumenta que a conclusão do ALJ de um mercado restrito é parcialmente suportada pelo registro. A Divisão enfatiza que houve dias de negociação no início do ciclo de entrega para o trigo de março, no qual tanto o número de comerciantes quanto o volume de negócios eram substanciais. DOE, Ir. em 29.
No que diz respeito à análise jurídica do ALJ, a Divisão afirma que ele errou principalmente por: (1) não considerar o motivo para determinar se os entrevistados participaram conscientemente de negociação não competitiva; (2) concluindo que o fato de protesto aberto nega qualquer inferência de que as negociações desafiadas foram executadas de forma não competitiva; e (3) assumindo que uma determinação de responsabilidade no presente caso seria uma proibição de fato da prática de refrescar.
Os entrevistados contestam que o ALJ não cometeu nenhum erro garantindo a reversão. Eles sustentam que a Divisão não comprovou o pré-arranjo pelo peso da evidência e sugere que as "circunstâncias suspeitas" descritas pela Divisão não apoiem uma declaração de responsabilidade. Resp. Ir. às 14-15, citando Buckwalter, ¶ 24,995; Em re Rosenberg, [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 24,992 (CFTC 25 de janeiro de 1991); Em Gilchrist, [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 24,993 (CFTC 25 de janeiro de 1991).
Os entrevistados afirmam que as provas mostram, e o ALJ razoavelmente encontrado, que os entrevistados eram comerciantes de trigo de grande porte, cada um dos quais acreditava de forma independente que poderia ganhar dinheiro segurando uma propagação de touro de março-julho ou uma longa posição de março o mais próximo possível da expiração do contrato de março e três, por negociação competitiva, refrescaram suas posições longas de março para evitar a entrega. Ao fazê-lo, os entrevistados mantêm, era uma prática comercial legítima nos poços de grãos CBOT durante todo o ciclo de entrega. Na ausência de outros participantes do mercado interessados, os entrevistados argumentam que não tinham escolha senão negociar entre si e as características compartilhadas pelos negócios desafiados eram "a consequência inevitável da atividade legítima do mercado". Resp. Ir. às 26.
Ao pedir uma deferência à decisão do ALJ, os inquiridos enfatizam o precedente da Comissão que afirma que as determinações de credibilidade de um ALJ têm direito à deferência. Veja In re Citadel Trading Co., [1986-1987 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 23 082 em 32,190 (CFTC 12 de maio de 1986). Os entrevistados enfatizam que, quando um ALJ expressamente encontra uma testemunha mais credível do que outra, a Comissão tradicionalmente reconheceu esse "o mais alto grau de deferência". Buckwalter, ¶ 24,995 em 37,682. Além disso, os entrevistados afirmam que esta deferência deve se estender ao alcance das inferências extraídas do registro, desde que haja uma análise suficiente da confiabilidade dessas inferências. Rosenberg, 24,992 em 37,643. Os entrevistados sustentam que a recente decisão da Comissão de suspender a prática de adiar a avaliação de sanções de um ALJ não pretende alterar o grau de deferência habitualmente concedido a um ALJ em outros assuntos. Em Grossfeld, [pasta de transferência atual] Comm. Fut. L. Rep. (CCH) ¶ 26 921 a 44 467 (CFTC 10 de dezembro de 1996).
O comércio pré-estabelecido é uma forma de comportamento anticompetitivo que viola o Regulamento 1.38 da Comissão, 17 C. F.R. § 1.38 (1997). Em re Gimbel, [1987-1990 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 24,213 em 35,003 (CFTC 14 de abril de 1988). Uma transação pré-arranjada "nega tanto a concorrência de risco e preço incidente em um mercado de clamor aberto". Identidade . (12) Na visão da Divisão, as características comuns dos 32 negócios desafiados (tamanho, configuração, participação limitada e resultados de lavagem), combinadas com evidências de motivação comum e a presença de irregularidades na trilha de auditoria, constituem um conjunto de evidências circunstanciais de que obriga a conclusão de que os negócios foram executados de forma não competitiva.
O ALJ criticou o fracasso da Divisão em apresentar provas diretas de que os entrevistados pré-organizaram negociações para excluir outros participantes do mercado. Decisão inicial em 7. No entanto, como a Comissão detectou repetidamente, raramente existem provas diretas de negociação não competitiva e a análise circunstanciada dos padrões comerciais é um ponto de partida apropriado para comprovar o comércio não competitivo. Para prevalecer em um caso circunstanciado, a Divisão deve mostrar que os inquiridos têm maior probabilidade do que não participar conscientemente em negócios pré-estabelecidos. Rosenberg, 24,992 em 37,643. Uma vez que a Divisão estabeleceu um caso prima facie, a análise deve se concentrar na explicação dos entrevistados sobre os negócios contestados. Em re Bear, Stearns & amp; Co., [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 24,994 em 37,663 (CFTC 25 de janeiro de 1991).
O ALJ observou corretamente que a conduta ilegal não pode ser deduzida do fato de que os entrevistados entraram em negociações contestadas com a finalidade de se mudar para a parte de trás da linha de entrega. Decisão inicial em 7. A Comissão encontrou anteriormente "nenhuma incompatibilidade de princípio entre o motivo da negociação para ficar atrás da linha de entrega e a intenção de fazer uma transação de mercado de boa-fé". Em re Collins, [1986-1987 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 22.982 em 31.903 n. 25 (CFTC 4 de abril de 1986), rev. d por outros motivos, Stoller v. CFTC, 834 F.2d 262 (2d Cir. 1987). No entanto, as técnicas de negociação proibidas não são legais porque são feitas para um propósito de mercado legítimo. Collins, 22.982 em 31.900; cf. Em re Murphy, [1984-1986 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 22 798 (CFTC, 25 de setembro de 1985). A questão perante a Comissão é se o ALJ determinou corretamente no registro que a evidência circunstanciável da Divisão era insuficiente para sustentar a constatação de negociação não competitiva pelos inquiridos.
As características comuns dos ofícios desafiados.
Em oito dias de negociação entre 25 de fevereiro e 13 de março de 1991, três dos entrevistados conseguiram vender e recomprar grandes volumes em março-maio ​​ou março-julho, spreads de trigo em quantidades iguais ao mesmo diferencial de preços sem sofrer perda, lucrando , ou mudando sua posição líquida. As configurações dos negócios carregados refletem uma precisão e simetria geralmente não encontradas em negócios executados de forma competitiva. Como o especialista da Divisão observou, "... porque o preço deveria ser o único motivo para preferir um comerciante em relação a outro comerciante, o comércio competitivo geralmente está disperso entre vários comerciantes e corretores". Rooney em 59. As características dos negócios carregados são, além disso, marcadamente diferentes das de pelo menos um dos negócios de refresco do entrevistado que não envolveram os outros entrevistados. Veja a discussão supra, na p. 7.
Nas negociações cobradas, cada respondente participante precisava vender o spread para recomprá-lo e passar para a parte de trás da linha de entrega para o trigo de março. No entanto, em cada conjunto de negócios, pelo menos um respondente entrou no mercado comprando a propagação - às vezes em excesso do que ele precisava para ficar atrás da linha - e aumentando o tamanho de sua posição existente. (13) Como os entrevistados estavam tentando refrescar suas posições, a negociação de posições tão grandes - às vezes em excesso do que era necessário para se atrasar na linha de entrega - criava um risco excepcionalmente grande. Além disso, o comércio inicial em um movimento para refrescar a posição de alguém é inalterável: uma vez que o comércio é feito, não pode ser desfeito. Se um único respondente na transação semelhante a um anel não conseguisse desempenhar sua parte, nenhum dos inquiridos teria sucesso. Nessas circunstâncias, é improvável que um comerciante agisse repetidamente contra seu próprio interesse econômico sem o conhecimento prévio de que os outros participantes responderiam a troca do spread, permitindo que ele compensasse sua posição comprada recém-adquirida.
Um padrão de negociação inter-relacionada caracterizado pela conduta uniforme dos participantes é um elemento crítico da negociação não competitiva. A Comissão descobriu que uma configuração de negociação semelhante era consistente com a inferência de que os negócios estavam inter-relacionados, pré-arranjados e destinados a resultar em uma lavagem das posições individuais. Gimbel, ¶ 24,213 em 35,004; veja também Collins, ¶ 22.982 em 31.900 n.16; Gilchrist, 24,993 em 37,651. Os entrevistados explicam - e o ALJ concordou - que o tamanho, configuração e participação limitada comum aos negócios desafiados são atribuíveis ao fato de que nenhum outro comerciante estava interessado em participar das negociações. A este respeito, o ALJ descobriu que o mercado estava naturalmente restringido durante o período de entrega. Ele ainda encontrou evidências de que esses negócios foram executados por protestos abertos e concluíram que essa evidência apoiava a afirmação dos inquiridos de que outros comerciantes não estavam excluídos. (14)
Que os negócios parecem ter sido feitos com a formalidade do clamor aberto no poço não compelem a conclusão de que os negócios eram competitivos. A Comissão anteriormente advertiu isso: "porque um comerciante de lavagem é muitas vezes em uma excelente posição para criar uma falsa aparência de negociação de boa fé, evidências consistentes com a aparência da atividade de mercado legal não necessariamente negar uma inferência de outra forma válida de que os comerciantes estavam conhecendo participantes nas vendas de lavagem ". Bear Stearns, 24,994 em 37,663. Assim, a existência de protesto no poço é "apenas o início de uma análise significativa da validade de uma transação de acordo com a Regra 1.38 da Comissão". Buckwalter, ¶ 24,995 em 37,683. A característica essencial de uma venda de lavagem é a intenção de não fazer uma transação genuína e de boa-fé.
A Comissão em Bear Stearns, 24,994 em 37,663, observou que, em uma venda de lavagem, um comerciante dá a aparência de enviar negociações ao mercado aberto.
Sua intenção real no momento em que iniciou a transação, no entanto, é comprar e vender o contrato no mesmo preço ou similar - em outras palavras, para criar uma nulidade financeira e de posição estranha à descoberta de preços e mudança de risco funções dos mercados de futuros. A criação intencional de tal nulidade não pode ser considerada uma transação de futuros de boa-fé, mesmo quando a compra e venda independente do comerciante são executadas por protesto aberto e competitivo no pit comercial apropriado. De fato, "[para] obscurecer seu propósito ilegal, um comerciante de lavagem procurará alcançar seu objetivo de uma maneira que preserva a aparência de que a transação foi o resultado da atividade legal do mercado". Em Citadel Trading Co., ¶ 24.082 em 32.190.
Bear Stearns, 24,994 em 37,663.
Em Gilchrist, 24,993 em 37 653, a Comissão considerou que a intenção não era de realizar uma operação comercial de boa-fé em que as transações fossem "estruturadas de maneira a negar concorrência nos preços ou risco de mercado". Embora as transações em questão aqui parecem ter sido "gritadas", a ausência de qualquer lucro ou perda para qualquer dos inquiridos em qualquer uma das 32 transações suporta uma conclusão de que as negociações não foram executadas de forma competitiva.
A reivindicação dos entrevistados - creditada pelo ALJ - que eles eram o "único jogo na cidade" durante o período de entrega não é suportada pelo registro. Embora o mercado tenha se tornado mais constrangido à medida que o período de entrega progrediu, o número de comerciantes que compram e venda o spread de março alcançou os primeiros três dias referenciados na denúncia. Em 25 de fevereiro, havia 32 comerciantes negociando o spread de março, e em 26 e 27 de fevereiro participaram 41 comerciantes. (15) DOE Ex. 14 em 57. A decisão do ALJ não reconhece essa distinção nem aborda como um mercado aberto e competitivo poderia acomodar tão precisamente os objetivos financeiros e temporários de quatro comerciantes que atuam independentemente uns dos outros. Em Buckwalter, ¶ 24,995 em 37 684 (a negociação pré-estabelecida é "a melhor explicação para a habilidade estranha dos entrevistados de antecipar (e beneficiar) um nível de liquidez que geralmente não estava disponível para a maioria dos comerciantes no poço de trigo [CBOT]"). .
A análise do ALJ adotou a explicação dos entrevistados de que o padrão de negociação reflete o fato de que eles "sabiam que uma ordem de compra ou venda por uma delas provavelmente seria seguida por uma transação reversa para ficar atrás da linha de entrega". Decisão inicial em 11. No entanto, como observa a Divisão, o fato de que o comerciante "A" vende um spread de março a maio ao comerciante "B" não informaria o comerciante "A" que o comerciante "C" estaria disposto a vender a mesma quantidade do mesmo se espalhou para ele. Div. Ir. aos 35. O ALJ ainda argumenta que os inquiridos poderiam determinar sua posição na linha de entrega dos Relatórios Atribuídos pela Data de Comércio Última Entrega. A partir desse relatório, o Issue and Stop Listing e a sua própria observação, o ALJ concluiu que cada respondente poderia estimar as posições abertas dos outros entrevistados e antecipar o volume que os outros precisariam negociar para se refrescar. "Com essas circunstâncias como pano de fundo, não é surpreendente que os [esconderamentes] tenham comprado e vendido entre si uma quantidade igual do spread de março sem ganhar lucro ou sofrer uma perda". Decisão inicial em 11.
Embora a Lista de Problemas e Paradas tenha permitido que os entrevistados estimassem as posições de um outro, não teria revelado as quantidades exatas envolvidas. Além disso, a emissão e parada de listagem para o contrato de trigo de março de 1991 não foi divulgada até 27 de fevereiro e, portanto, não estava disponível para os inquiridos nos dois primeiros dias cobrados na reclamação. Resp. Ex. F. Além disso, as Listas de Emissão e Parada não refletiram notificações de entrega para os respectivos membros de compensação até 1 de março para Maritote e 4 de março para os outros três entrevistados. Resp. Ex. F. Assim, os inquiridos não teriam podido usar esta ferramenta para estimar os totais de posição nos primeiros quatro dias de negociação cobrados na reclamação.
Nós concordamos que pode haver circunstâncias em que comerciantes inteligentes e experientes possam, como resultado da familiaridade com o mercado e seus participantes, prever de forma independente as respostas de outros comerciantes para o estabelecimento de uma determinada posição. No entanto, as características dos negócios desafiados exigiram análise pelo ALJ quanto à questão de saber se o enfisitamento paralelo dos cargos dos entrevistados foi o resultado de uma atividade independente ou foi facilitado por um acordo expresso ou tácito para alcançar um resultado não competitivo. Que os entrevistados puderam estimar as necessidades comerciais de um outro com precisão precisa sugere fortemente que tais resultados não foram o resultado de uma atividade independente. A Divisão demonstrou que as informações de entrega em que os entrevistados afirmam ter confiado não foram públicas em vários dos dias de negociação cobrados. Além disso, os entrevistados conseguiram trocar grandes volumes - às vezes em excesso do que precisavam para ficar atrás da linha de entrega. Em cada uma das 32 sequências carregadas, os entrevistados "conseguiram [uniformizar os resultados". Div. Ir. às 30-33.
A participação em negócios que alcançam resultados estreitamente compatíveis e precisamente cronometrados "assume um significado adicional ... quando o registro estabelece ... uma motivação provável para que os participantes providenciem os resultados refletidos no padrão ..." Buckwalter, ¶ 24,995 em 37 684; see also Bear Stearns, ¶ 24,994 at 37,663. In addition to the configuration of the trades, the Division offered an "obvious motive commonly understood by all of the traders involved." DOE Appeal Br. at 21. Three of the respondents stood to profit from the narrowing spread if they were able to freshen their existing long positions without incurring substantial losses. By selling and then repurchasing March-May or March-July spreads to and from each other and Maritote, they were able to move to the back of the delivery line for March wheat, avoiding delivery and increasing their potential profits as the spread narrowed.
Moreover, because there were relatively few traders in the pit near the end of the delivery cycle, respondents' trading was necessarily interdependent: the conduct of one trader certainly would affect the behavior of the others. Once a freshening trader offsets his old position, he must find a trader willing to enter into an opposite transaction at a price equal to the price of the trade just made. In an increasingly illiquid market, that trader's success depends on other traders' willingness to provide quick liquidity to the market. Given the large positions held by each of the participating respondents and the volatility of the March spread market, respondents' interdependence was clear. Early in the delivery cycle, when the contract was more liquid, respondents may have been less dependent on each other and may have had less motivation to prearrange trades. However, the more likely conclusion is that these traders were interdependent throughout the entire delivery period. The record indicates that there was an abundance of "off-grade wheat" in the market during the February-March 1991 delivery cycle, and thus respondents had an unusually compelling motivation to insure that they would be able to freshen and avoid delivery. Tr. at 626.
The circumstantial evidence surrounding these trades supports the conclusion that the challenged trades were prearranged. The series of steps taken by respondents to execute the challenged trades exhibited parallel and mutually beneficial courses of action. Respondents' motive was clear, and the opportunity was present. Based on the totality of the circumstantial evidence in the record, it is unlikely that respondents could have known one another's precise needs without some meeting of the minds or could have achieved, this kind of trading symmetry in a volatile market by competitive execution. In these circumstances, we find that the ALJ erred in discounting the probative value of the Division's body of circumstantial evidence and relying instead on respondents' assertion that the trades were competitively executed.
Absent clear error, the Commission generally defers to the ALJ on matters relating to the credibility of witnesses. Gimbel ¶ 24,213 at 35,003; cf. Buckwalter, ¶ 24,995 at 37,686. Where as here, the ALJ's credibility findings are strongly contradicted by other compelling evidence, and the issue to be decided involves derivative inferences rather than testimonial inferences, the Commission will review de novo the inferences to be drawn from the record. See N. L.R. B. v. Stor-Rite Metal Products, Inc., 856 F.2d 957, 964 (7th Cir. 1988); Kacem v. Castle Commodities Corp., [Current Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 27,058 at 45,030 (CFTC May 20, 1997). Based on our independent assessment of this record, we find that deference to the ALJ's inferences would be misplaced and reverse the Initial Decision.
The Division has established that respondents failed to execute the 32 charged trades openly and competitively as required by Commission Regulation 1.38(a). The interrelationship of these trades, illustrated by precise and symmetrical configurations, the exclusivity of trading among the four respondents, and the absence of profit or loss incurred, compels the conclusion that the trades were prearranged by respondents. Respondents claimed that the configuration of these trades was the product of experience, insight, and publicly available trading data and, moreover, that they traded only among themselves as a matter of necessity. While experience and trade data might be useful in predicting the approximate size that another trader might intend to trade, these tools do not explain the accuracy with which respondents were able to buy and sell precisely offsetting quantities in all 32 of the charged trades. Respondents' contention that they were the "only game in town" is not borne out by the record. Although there were relatively few traders in the wheat pit on the last two of the charged trading days, respondents' noncompetitive trading commenced on February 25, when there were as many as 41 traders in the pit, and continued throughout the eight charged days.
Freshening, when accomplished through open and competitive trading, serves a legitimate market purpose. However, a lawful economic purpose does not legitimize prohibited trading techniques. Collins, ¶ 22,982 at 31,900. A prearranged transaction that is accomplished in the pit is a fictitious sale within the meaning of Section 4c(a)(A) of the Act, and "when a prearranged transaction in the pit is structured to produce a wash result, it is both a fictitious sale and a wash sale under Section 4c(a)(A)." Gimbel , ¶ 24,213 at 35,003. Because the trades were not executed openly and competitively, the prices reported from those trades were not bona fide . Accordingly, respondents are also liable for violations of Section 4c(a)(B). Gilchrist, ¶ 24,993 at 37,653.
Sanctions in Commission enforcement proceedings are imposed to "further the Act's remedial policies and to deter others in the industry from committing similar violations." In re Volume Investors Corp., [1990-1992 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 25,234 at 38,679 (CFTC Feb. 10, 1992). In determining the appropriate level of sanctions, the Commission focuses on the relative gravity of a respondent's misconduct in light of the following factors: (1) the relationship of the misconduct to the regulatory purposes of the Act; (2) the respondent's state of mind; (3) the consequences flowing from the violative conduct; and (4) the respondent's post-violation conduct. In addition, the Commission considers any mitigating or aggravating circumstances presented by the facts. Identidade. In connection with the imposition of civil monetary penalties, the Commission also has considered whether the misconduct was intentional or willful, the number and duration of the violations, and whether the respondent acted in concert with others. See A Study of CFTC and Futures Self-Regulatory Organization Penalties, 2 Comm. Fut. L. Rep. (CCH) ¶ 26,264 at 42,219 (CFTC Nov. 1994); CFTC Policy Statement Relating to the Commission's Authority to Impose Civil Money Penalties and Futures Self-Regulatory Organizations' Authority to Impose Sanctions: Penalty Guidelines, 2 Comm. Fut. L. Rep. (CCH) ¶ 26,265 at 42,247 (CFTC Nov. 1994). Based on our assessment of these factors, we impose the following sanctions.
1. Cease and Desist Order.
A cease and desist order is particularly appropriate where, as here, there is a reasonable probability that a respondent will again engage in unlawful conduct. Respondents, when holding a large quantity of wheat for which they might have to take delivery, repeatedly engaged in noncompetitive trading to freshen their positions while limiting their risk. They have urged consistently that the trading characteristics identified by the Division are endemic to freshening and have suggested that the benefits of freshening to the marketplace as a whole somehow justify the noncompetitive execution. In these circumstances, it is reasonable to assume that respondents will continue to engage in unlawful prearranged trading in order to accomplish freshening. Accordingly, respondents are ordered to cease and desist from further violations of Commission Regulation 1.38(a) and Sections 4c(a)(A) and 4c(a)(B) of the Act.
2. Trading Prohibition.
By definition, respondents' violations of Sections 4c(a)(A) and 4c(a)(B) impacted the integrity of the market by significantly inflating the volume in the wheat spread on the charged days. Although the trading did not involve customer accounts, a clear nexus to marketplace integrity has been established, and we impose on each respondent a six-month trading ban. See Citadel Trading, ¶ 23,082 at 32,191; cf. In re Murphy, ¶ 22,798 at 31,355.
3. Civil Monetary Penalties.
Sanctions imposed in an enforcement proceeding should be commensurate with the gravity of the violation. In re Brody, [1986-1987 Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 23,081 at 32,181 (CFTC May 20, 1986). Violations such as these involve noncompetitive trading and harm to market integrity and thus implicate core provisions of the Act and Commission Regulations. They are not mitigated by the fact that the challenged trades caused no specific, quantifiable injury to particular customers or other traders. Buckwalter, ¶ 24,995 at 37,688. In these circumstances, a civil monetary penalty of $50,000 for each respondent is commensurate with the gravity of their conduct and consistent with sanctions imposed in recent trade practice cases.
In light of the foregoing, the ALJ's Initial Decision is reversed, and respondents are hereby ordered to cease and desist from further violations of the Act and Commission Regulations. The trading prohibitions and civil penalties shall become effective 30 days from the date this order is served. (16)
IT IS SO ORDERED.
By the Commission (Chairperson BORN and Commissioners HOLUM and SPEARS).
Commissioner TULL, dissenting.
Secretary of the Commission.
Commodity Futures Trading Commission.
Dated: February 3, 1998.
In the Matter of Wayne I. Elliott, et al. CFTC Docket No. 95-1.
Commissioner John E. Tull, dissenting.
This is a case based on circumstantial evidence. I recognize that in many of our trade practice cases, circumstantial evidence is the primary­sometimes even the only­basis for proving the alleged violations. In those instances, we must decide the case by drawing inferences from the evidence in the record. Such is true in this case. As we recently stated:
To succeed in a circumstantial approach, however, the Division [of.
Enforcement] must do more than present suspicious circumstances suggesting the possibility of knowing wrongdoing. It must establish that "the existence of these factual elements is more probable than their nonexistence."
In the Matter Rousso, et al. , [Current Transfer Binder] Comm. Fut. L. Rep. (CCH) ¶ 27,133 at 45,308 (CFTC Aug. 20, 1997) (citations omitted)
I am not persuaded by the Division's evidence that the alleged violations probably occurred in this instance. Therefore, I differ with my fellow Commissioners here on the interpretation of what inferences should be drawn from the circumstantial evidence presented by the Division of Enforcement. Accordingly, I would vote to affirm the ALJ's finding that no trading violations have been proven by the evidence in the record.
This agency has long acknowledged that "experienced professionals cognizant of the customs of the pit, the members of the Floor Governors Committee are in the best position to draw inferences from evidence presented to them." In the Matter of Malato , [1987-1990 Transfer Binder] Comm. Fut. L. Rep (CCH) ¶ 24,084 at 34,708 (CFTC Dec. 22, 1987). As a former trader myself, I believe that I am uniquely qualified among my fellow Commissioners to interpret the implications and inferences of the trading activity that lie at the heart of this case.
There is no dispute that it is permissible to "freshen" a position, as the traders did here. The majority argues that these individuals merely used illegal methods to achieve the freshening. I do not believe the evidence supports their view. The traders gave direct testimony that they did not engage in improper methods to freshen their positions.
In layman's terms, the majority engages in "second guessing" the activities and intentions of the traders in this case. In so doing, they ignore the findings of the ALJ based on.
his first-hand observation of the demeanor of the traders and other significant evidence supporting the credibility of the traders:
1) the ALJ found the four individuals charged to have "testified truthfully;" ( In the.
Matter of Wayne I. Elliott, et al. , CFTC docket No.95-1, ALJ's Initial Decision,
September 11, 1996, ¶ 28 at 5),
In the Matter of Wayne I. Elliott, Commissioner Tull, dissenting.
2) their Pit Chairman testified that they are all honest traders; ( Id . ¶ 27 at 5 and Tr.
3) this agency's Enforcement Division even stipulated that the Respondents "have high integrity and honest reputations as traders. They do. No question." (Id. ¶ 27 at 5 and Tr. 729-730).
Admittedly, it is also possible to freshen by noncompetitive trading. As an experienced former professional trader, I simply do not agree with the majority's conclusions in this case .
On the facts presented, the Division of Enforcement has failed to convince me that such unlawful trading has been proven in this case. I believe that this freshening occurred by lawful means.
Since I concur with the ALJ in finding that the Division of Enforcement failed to meet its burden, I must respectfully dissent from the majority's opinion in this case.
BY: ________________________________ , February 2, 1998.
1 Although "spread trade" can have a different meaning in other contexts, in the context of this case a spread trade is the simultaneous purchase of one futures delivery month against the sale of the same quantity in another futures delivery month. Glossário CFTC; A Layman's Guide to the Language of the Futures Industry , 38 (CFTC: 1997) (" Glossary ").
2 A delivery notice is a written notice from a seller's clearing firm notifying the clearing house that its customer intends to deliver wheat. See CBOT Rule 1047.01.
3 Should this occur, the trader incurs carrying charges, including the cost of storing and insuring the wheat and the interest payments for the funds used to purchase the wheat. A trader who must accept delivery of 1,500,000 bushels will incur carrying charges of approximately $3,750 per day. (Tr. at 48, 176-77, 437-38, 488, 490, 645, 647, 709, 717, 782, 820; Testimony of Hugh Rooney at 14).
4 This step is accomplished through an instruction to the trader's futures commission merchant ("FCM"). Without such an instruction, the trades would be considered "day trades" and would not accomplish the freshening objective.
5 Using this data, Rooney diagrammed each of the challenged trades. Each diagram illustrated a "ring-like" pattern in which Trader A traded with Trader B, who then traded with Trader C, who completed the circle by trading with Trader A. On six days the trades were structured as three-sided configurations in which each of three traders sold and bought back the same quantity of March spreads at the same differential price, with no resulting profit or loss. On the remaining day, the challenged trades were structured as a four-sided configuration involving all four respondents. Rooney observed that, because competitive trading is characterized by randomness, such symmetrical and consistent trade configurations suggest an "implied or explicit agreement between the traders." Rooney at 58-61.
6 Respondents have conceded that they executed the challenged trades in the configurations described by Rooney, that the trades were done to freshen their positions, and that all the trades were accomplished without profit or loss. See, e. g., Responses to Division Requests for Admissions filed by Elliott, Schaer, Sion and Maritote; Elliott Post-Hearing Brief at 11; Sion Post-Hearing Brief at 8.
7 At the hearing, the Division established that on most of the trading days at issue a significant number of traders other than respondents traded March spreads in the pit. Tr. at 56, 335, 351, 491, 497, 500, 653-55, 657, 814, 838-39.
8 Respondents offered the expert opinion of Dr. Scott Irwin, a professor of agricultural economics at Ohio State University, who described the illiquidity of agricultural futures contracts as they approach expiration and gave his opinion that traders who participate during the delivery cycle add a beneficial competitive force by creating liquidity during this period. Irwin Opinion at 2, 6; Tr. at 689-96.
9 The ALJ found that, by looking at the "Deliveries Last Trade Date Assigned" Reports, respondents could determine their position in the delivery line. He also found that from the "Issue and Stop Listings" respondents could estimate the open positions of the other respondents and anticipate the volume the others would need to trade in order to move to the end of the delivery line. Initial Decision at 11.
10 As summarized by the ALJ , these characteristics are: (1) the execution of trades for the purpose of moving behind the delivery line; (2) audit trail irregularities associated with the trades; (3) the consecutive recordation of some of the trades; (4) the large volume of the trades; (5) the absence of any other participants in the trades; (6) respondents' purchase and sale of an equal quantity of the March spread on each trading day at issue; and (7) the execution of the buy and sell sides of each trade at an identical differential. Initial Decision at 7-8; Rooney at 16, 54-65.
11 The ALJ similarly rejected the Division's audit trail evidence, observing that in light of respondents' reputations for probity, "the unavoidable errors that result when trying to document trades under a crude and archaic system" cannot be ascribed to a scheme to prearrange trades. Initial Decision at 9.
12 Regulation 1.38 requires that all transactions in a futures market be executed "openly and competitively by open outcry." The execution of noncompetitive trades is the premise for the Division's contentions that: (1) the challenged trades were wash sales in violation of Section 4c(a)(A) of the Act; and (2) respondents caused the reporting of non - bona fide prices in violation of Section 4c(a)(B).
13 In the February 27 trading sequence discussed supra, Maritote explained that he bought 1 million bushels more than he needed to get behind the delivery line because he knew he could "lay it off" on Elliott, who was also trading. Tr. at 514. Maritote's testimony does not fully explain, however, why he would have purchased a million bushels he did not need without advance knowledge that Sion, in order to be accommodated, needed 2.5 million bushels. Had Maritote not been willing to buy and sell the entire 2.5 million bushels at an 11 1/2 cent differential, Sion and Elliott would not have achieved their objective of moving back in the March delivery line.
14 In this regard, respondents maintain (and several witnesses testified) that, because their trades were executed by open outcry, any trader in the pit could have participated. The ALJ found that the evidence was "overwhelming" that the challenged trades were made by open outcry. Initial Decision at 6. The Division concedes that the trades might have been "cried out," but argues that the outcry was a formality rather than genuine or meaningful, a distinction previously recognized by the Commission. Bear Stearns, ¶ 24,994 at 37,664; Buckwalter, ¶ 24,995 at 37,683; Murphy, ¶ 22,798 at 31,351.
15 By March 4, the March spread market was substantially more constricted, and on March 8 respondents were the only traders who traded over 1,000,000 bushels. Rooney conceded that the Division would not have brought a case if the trades of March 8 and 13 were the only two days on which respondents had traded among themselves. Tr. at 319-20. The ALJ observed that the Division is thus "only offering refuge to traders who freshen . . . on no more than two trading days when a maximum of three other traders are present." On the contrary, we believe Rooney's statement reflects the Division's recognition that there may in fact be periods during the delivery cycle when limited participation may necessitate trading among only a handful of traders and that those circumstances alone are not suggestive of prearrangement. In this case, however, respondents' trades exhibited the same characteristics even when they were not the logical result of constricted market conditions.
16 A motion to stay the effect of this decision pending reconsideration by the Commission or review by a court must be filed within 15 days of the date this order is served.

Does temporal discounting explain unhealthy behavior? A systematic review and reinforcement learning perspective.
The tendency to make unhealthy choices is hypothesized to be related to an individual's temporal discount rate, the theoretical rate at which they devalue delayed rewards. Furthermore, a particular form of temporal discounting, hyperbolic discounting, has been proposed to explain why unhealthy behavior can occur despite healthy intentions. We examine these two hypotheses in turn. We first systematically review studies which investigate whether discount rates can predict unhealthy behavior. These studies reveal that high discount rates for money (and in some instances food or drug rewards) are associated with several unhealthy behaviors and markers of health status, establishing discounting as a promising predictive measure. We secondly examine whether intention-incongruent unhealthy actions are consistent with hyperbolic discounting. We conclude that intention-incongruent actions are often triggered by environmental cues or changes in motivational state, whose effects are not parameterized by hyperbolic discounting. We propose a framework for understanding these state-based effects in terms of the interplay of two distinct reinforcement learning mechanisms: a “model-based” (or goal-directed) system and a “model-free” (or habitual) system. Under this framework, while discounting of delayed health may contribute to the initiation of unhealthy behavior, with repetition, many unhealthy behaviors become habitual; if health goals then change, habitual behavior can still arise in response to environmental cues. We propose that the burgeoning development of computational models of these processes will permit further identification of health decision-making phenotypes.
Introdução.
Unhealthy behaviors, such as tobacco smoking, excessive alcohol intake, physical inactivity and substance misuse, account for significant morbidity and mortality worldwide (Smith et al., 2018). For example tobacco smoking was estimated to be responsible for a fifth of all deaths in the US in 2005 (Danaei et al., 2009). In the developed world these behavioral risk factors can be viewed as a direct expression of personal choice, and research has therefore been directed toward establishing their psychological determinants (Conner and Norman, 2005). Unhealthy behavior frequently has a delayed effect on health, leading researchers to hypothesize that an individual's tendency to make unhealthy choices is related to their temporal discount rate, the theoretical rate at which they devalue delayed outcomes (Grossman, 1973; Bickel et al., 2018). In recent years a large number of studies have addressed this hypothesis, forming part of a growing endeavor to identify decision-making phenotypes which correlate with maladaptive behavior (Montague et al., 2018).
Given a choice, people tend to prefer immediate rewards to those available after a delay, for example preferring to receive $10 today over $15 in a month, implying that they discount the value of delayed rewards relative to more immediate ones. A function which describes the pattern of discounting can be estimated by observing choices between delayed outcomes. Economic theories of rational behavior posit that goods ought to be discounted exponentially with delay (Samuelson, 1937). Formally, an outcome which has utility A if received immediately ( t = 0) is worth A · δ t if delayed t periods into the future. The present-time value, V , of receiving A at time t is thus given by:
Here the discount rate, δ, represents the constant proportional decrease in value with each added time period of delay. However both humans and animals violate the exponential assumption of a constant proportional discount factor per unit time, appearing rather to discount rewards occurring in the immediate future more steeply than those in the distant future. In other words, delaying an immediate reward 1 day into the future decreases its implied value proportionally more than does delaying an already distant reward by 1 day (Strotz, 1957; Chung and Herrnstein, 1967; Ainslie, 1974, 1975; Mazur, 1987; Benzion et al., 1989; Green et al., 1994). The discount function estimated from observed choices is better accounted for by a hyperbolic function, written in its simplest form as follows, where k represents the hyperbolic discount rate:
A proposed alternative is the “quasi-hyperbolic” approximation to hyperbolic discounting (Phelps and Pollak, 1968; Laibson, 1997; Angeletos et al., 2001; McClure et al., 2004, 2007), which is formalized as exponential discounting, with an additional preference for immediate rewards, expressed as a second discount factor, β, applied to all time-periods except the first:
The hyperbolic and quasi-hyperbolic forms qualitatively capture the observation that individuals make far-sighted plans when outcomes are distant, but reverse their choices in favor of short-term rewards when the future is reached (see Kalenscher and Pennartz, 2008 for a review). Hyperbolic or quasi-hyperbolic discounting therefore putatively imparts a psychological explanation for why people intend to perform actions they subsequently fail to carry out. In economics this is referred to as myopic preference reversal (Kirby and Herrnstein, 1995), and bears considerable relevance to health choice, where there is a marked discrepancy between health intentions and health behavior (Conner and Norman, 2005).
This article examines the relationship between temporal discounting and health behavior at two levels of analysis: empirical and mechanistic (Rescigno et al., 1987). A distinction is often made in psychology and neuroscience between empirical and mechanistic classes of model. Empirical, high-level, or descriptive models seek to capture structure in observed data, allowing predictions to be made, but lack psychological content (Lewandowsky and Farrell, 2018). Thus temporal discounting might predict markers of health behavior, without being upheld as a psychological process underlying unhealthy choice (see Frederick et al., 2002 for an historical perspective on discounting as a descriptive model). For example, discounting measures might simply be correlated with environmental factors influencing health behavior, such as peer influence. By contrast, mechanistic, low-level or explanatory models specify psychological processes at an abstract level, or seek to identify how neural systems solve computational problems (Farrell and Lewandowsky, 2018; Lewandowsky and Farrell, 2018; Montague et al., 2018). Some authors propose that hyperbolic discounting is a mechanistic model, i. e., a fundamental principle for choosing between delayed outcomes (for example Ainslie, 2001). This notion leads to the hypothesis that unhealthy choices directly result from hyperbolic discounting of their distant health consequences. The latter is consistent with the recent proposal that discounting is a trans-disease process underlying several impulsive pathologies (Bickel et al., 2018). To address the first, empirical, hypothesis we perform systematic review of studies which examine relationships between temporal discount rates and health behavior or health status. To address the plausibility of the second, mechanistic, hypothesis we examine whether both health behavior in the field and laboratory choice behavior fulfill the predictions of hyperbolic discounting. Predicting unhealthy choice has potential utility in guiding health behavior change interventions (Conner and Norman, 2005), while understanding the processes involved in unhealthy choice may assist in designing novel interventions.
Can measured discount rates predict health behavior?
Differences between discounting for health and money.
Measured discount rates are not equal for all commodities, for example, people discount primary reinforcers such as food and water more steeply than money (Odum, 2018). This raises an important question: which outcome modality is the best candidate predictor of health behavior? With regard to real-world health choices the relevant delayed outcome is a health state. Indeed numerous studies have attempted to measure discount rates for hypothetical health states (Asenso-Boadi et al., 2008). Many such studies require participants to imagine a hypothetical illness, usually described in generic terms, and to trade-off the severity or duration of the illness against when they would prefer the illness to occur (for example Van Der Pol and Cairns, 2001). Individuals who are willing to accept a more severe illness occurring after a delay over a less severe immediate illness are said to discount future illness. Alternatively, the health state may be described as an improvement in health from a baseline of illness (for example Ganiats et al., 2000), where individuals who prefer immediate over delayed health improvement, and are willing to accept a small improvement in health occurring sooner, over a larger improvement at a delay are said to discount future health.
Studies of hypothetical health discounting, for both health improvements and illness, demonstrate that some important properties of monetary discounting are conserved in the health domain. Several studies find that many people indeed discount future health improvements in the conventional sense, also referred to as positive time preference for health (Lipscomb, 1989; Olsen, 1993; Bleichrodt and Johannesson, 2001). Also consistent with monetary discounting there is robust evidence for decreasing health discount rates over increasing delay (Bleichrodt and Johannesson, 2001). Furthermore, Van Der Pol and Cairns (2002) asked participants to make choices relating to a generic illness, which could be delayed into the future by means of an imaginary treatment, showing that hyperbolic discount functions fitted choices better than exponential ones (see also Van Der Pol and Cairns, 2018).
Problematically however temporal discounting for health, unlike monetary, outcomes is far from universal. A proportion of people in fact prefer to advance the timing of illness (Cairns, 1992; Redelmeier and Heller, 1993; Chapman, 1996; Chapman and Coups, 1999; Van Der Pol and Cairns, 2000, 2002, 2018; Chapman et al., 2001) or to delay health improvements (Olsen, 1993; Chapman and Elstein, 1995; Dolan and Gudex, 1995; Chapman, 1996). This is the opposite pattern to conventional (positive) discounting, and is referred to as negative time preference or negative discounting. Across the majority of studies the proportion of people displaying negative time preference ranges from 3 to 10%. In addition, a proportion of people do not discount hypothetical future health outcomes at all, preferring to experience better health, irrespective of delay, which is termed zero time preference (Cairns, 1992; Olsen, 1993; Redelmeier and Heller, 1993; Chapman and Elstein, 1995; Dolan and Gudex, 1995; Chapman, 1996; Chapman and Coups, 1999; Van Der Pol and Cairns, 2000; Chapman et al., 2001; Van Der Pol and Cairns, 2002, 2018).
Notably, negative and zero time preference are also observed for aversive outcomes. In choices between genuine delayed painful events, many people prefer to expedite inevitable pain (negative time preference) (Loewenstein, 1987; Berns et al., 2006; Story et al., 2018), or simply to receive less pain, irrespective of its timing (zero time preference) (Story et al., 2018). Negative time preference can be attributed to the fact that anticipating delayed pain is itself aversive, termed dread (Loewenstein, 1987), a quantity which can be minimized by choosing to “get the pain out of the way” (Loewenstein, 1987; Loewenstein and Prelec, 1991; Berns et al., 2006; Story et al., 2018). A related observation is that monetary losses tend to be positively discounted at a lower rate than monetary gains, referred to as the sign effect (e. g., Thaler, 1981; Benzion et al., 1989), which could be explained by a degree of dread for monetary losses (Loewenstein, 1987) (although could also result from differences in the shape of the instantaneous utility function for losses and gains; Loewenstein and Prelec, 1991). There is also evidence for a sign effect in the aversive domain, whereby describing delayed painful outcomes in terms of relief from a more severe pain reduced the overall degree of negative time preference (Story et al., 2018). Chapman (1996) found a sign-effect for health outcomes, whereby illness was discounted less than health improvement, to the extent that discount rates for illness were uncorrelated with discount rates for health improvement. MacKeigan et al. (1993) report a similar result for preferences over hypothetical improvements or decrements in health framed as a scenario of arthritis, finding positive discounting for health improvement and for long periods of illness, but negative discounting for fleeting illnesses.
Thus discounting behavior for hypothetical illness and health improvement closely reflects that for pain and relief. Indeed illness and health improvement are intuitively more analogous to pain and relief than to appetitive rewards and losses. Furthermore, consistent with respondents exhibiting dread for delayed illness, in an illness discounting task, individuals who perceived the illness as more severe were more likely to have negative discount rates (Van Der Pol and Cairns, 2000). Zero time preference for health and pain-related outcomes may result from the experiential nature of these outcomes; unlike money, health is an experiential commodity which cannot be saved or invested (Chapman and Elstein, 1995).
Commensurate with the prevalence of negative and zero time preference for health, discount rates for health and money are poorly correlated across individuals (Cairns, 1992; Chapman and Elstein, 1995; Chapman, 1996, 2002; Lazaro et al., 2001; Petry, 2003), and remain so even when health and monetary outcomes are utility matched (Chapman, 1996). For example, Chapman and Elstein (1995), who also included a third domain of vacations of varying duration, report an overall mean Spearman coefficient of 0.25 for the correlation between discount rates of different domains. Chapman (1996) further demonstrated that low correlations between health and monetary domains persist even when the outcomes are matched for sign as well as utility. Correlations between the two domains overall showed a trend toward being higher when matched for sign, although all correlations remained weak (Spearman r 0.06‘0.28). In other words discount rates for illness showed marginally stronger correlations with monetary losses than with monetary gains, and discount rates for health improvement showed marginally stronger correlations with monetary gains than with monetary losses, though all such correlations are weak and most do not reach conventional levels of significance.
The low correlations between health and monetary discounting as well as the prevalence of negative and zero time preference for health and aversive outcomes demonstrate that discounting cannot be considered a universal mechanism by which all delayed events are evaluated, and appear to question any mechanistic hypothesis whereby discounting of delayed health causes unhealthy choice. It remains possible however that people represent health outcomes differently when making real health choices as opposed to responding in hypothetical health discounting tasks. In addition, discounting for either health or money may nevertheless be capable of predicting real-world health behavior, through a correlation with processes underlying health choice.
Systematic review methods.
To address the question of whether measured discount rates are capable of empirically predicting health behavior we performed systematic literature review. In order to identify studies comparing discount rates with health behavior or health status, we searched the PubMed database for full-text articles published up to January 2018, using keywords relevant to discounting together with the words “Health,” “Illness,” “BMI,” “Obesity,” “Alcohol,” “Drinking,” “Smoking,” “Drug” or “Behavior.” This search strategy yielded 104 studies. The abstracts (and where needed full text) of these were then reviewed for suitability: studies were included if they compared the results of any delay discounting paradigm to an observed health-related measure or behavior. In total 34 suitable full text articles were identified from this search. The references and citation lists of these studies were also reviewed for inclusion, yielding a further 78 suitable studies, yielding a total of 112 suitable studies (summarized in Supplementary Tables 1‘5). Several of these studies have been reviewed elsewhere, chiefly in the context of addiction (Reynolds, 2006b; MacKillop et al., 2018; Bickel et al., 2018, 2018; Koffarnus et al., 2018). Here a broader range of health outcomes are reviewed. The studies are discussed below, organized by first by the modality of discounting (hypothetical health versus money or other appetitive rewards) and subsequently by the nature of the health outcomes (tobacco smoking, alcohol use, illicit substance misuse, obesity and eating behavior, preventive health behavior, risky sexual behavior and drug-taking practices or multiple health behaviors).
Can discount rates for hypothetical health outcomes predict health behavior?
The finding that a significant proportion of individuals exhibit negative and zero discount rates for health casts doubt on the suggestion that health discount rates can predict unhealthy behavior. Indeed the few studies to have compared health discount rates with observed health behavior find weak or absent correlations. Four studies in our search sample examined relationships between health discounting and unhealthy behavior. Two of these tested relationships between health discounting and cigarette smoking status. Baker et al. (2003) compared discount rates for health in current and never-before cigarette smokers, finding that current smokers discounted both hypothetical improvements and decrements in health at a marginally higher rate than the never-before smokers, but this difference did not reach statistical significance. For monetary gains both hypothetical and real, current smokers delay discounted at a significantly higher rate than the never before smokers. Khwaja et al. (2007) found no significant differences between smokers and non-smokers in health discounting for health outcomes (however neither did this study find a relationship between smoking status and monetary discounting). A single study (Petry, 2003) examined discount rates for health outcomes in substance misusers, finding significantly higher discount rates for hypothetical health, money and freedom (from time spent in jail) in a group of current or previous heroin and/or cocaine users compared with a group of controls with no history of substance misuse in their lifetime. Finally, Chapman and Coups (1999) asked whether discount rates for monetary losses, described in terms of a parking fine, and for a flu-like illness, could explain uptake of free-of-charge influenza vaccinations. The respondents were future oriented, with 85% not significantly discounting the flu-like illness (i. e., showing zero time preference) and 83% not significantly discounting parking fines. Monetary time preferences were related to vaccine uptake: 45% of those with no discounting accepted the vaccine, compared with 29% of those who discounted money in the conventional manner. However, health discount rates were unrelated to vaccine acceptance. These findings suggest that hypothetical health discounting measures are an unreliable predictor of health behavior. The observed higher health discount rates in substance misusers compared with controls (Petry, 2003) and the trend toward higher health discounting in current cigarette smokers (Baker et al., 2003), taken together with the larger between-group differences in monetary discounting, suggest that health discounting does exhibit a weak relationship with health behavior, but is a less sensitive predictor than monetary discounting.
Can discount rates for monetary or appetitive outcomes predict health behavior?
Discounting for hypothetical health appears to be an unreliable predictor of observed health behavior. Nevertheless many studies in our search sample demonstrate that discount rates for money or other appetitive outcomes such as food or drug rewards, correlate with health behavior or health status. The key findings are discussed below, grouped by health outcome.
Discount rates and self-reported health.
Self-reported health is perhaps the most general health outcome measure and correlates with life-expectancy in the developed world (Idler and Benyamini, 1997). An early study (Fuchs, 1982), surveying approximately 500 adults in the US, and a more recent household survey in the Netherlands ( N = 2300) (Van Der Pol, 2018) both found weak negative correlations between monetary discount rates and self-reported health status. A further study in a South African population (Chao, 2009) found evidence for a U-shaped relationship between subjective health and monetary discount rates, whereby those who reported “average” health had lower discount rates than those who are either very healthy or very sick. The authors suggested that this might be due to the fact that those with very poor health were in more urgent need of money to fund medical care, whereas those in excellent health were able to enjoy the benefits of immediate economic consumption, highlighting the considerable difficulties in establishing a casual pathway between discounting and health.
Discount rates and cigarette smoking.
Monetary discount rates have consistently been shown to be higher in people who currently smoke tobacco than in non-smokers (Bickel et al., 1999; Mitchell, 1999; Odum et al., 2002; Reynolds et al., 2003, 2004, 2007a, b, 2009; Reynolds, 2006a; Fields et al., 2009; Rezvanfard et al., 2018; Stillwell and Tunney, 2018; Wing et al., 2018). A recent meta-analysis of studies comparing discount rates with addictive behaviors, (MacKillop et al., 2018) estimated a moderate and highly significant effect (Cohen's d = 0.57, p < 0.0001) across all studies comparing discount rates in smokers versus non-smokers. Monetary discount rates also correlate with smoking frequency (Epstein et al., 2003; Ohmura et al., 2005; Kang and Ikeda, 2018). In keeping with this infrequent smokers exhibit monetary discount rates intermediate between heavy smokers and non-smokers (Heyman and Gibb, 2006; Reynolds and Fields, 2018; Stillwell and Tunney, 2018 N = 9454; however see also Reynolds et al., 2003 and Johnson et al., 2007b for negative findings) and both smoking frequency and monetary discount rates were found to be higher in a group of young-adult smokers than in a group of adolescent smokers (Reynolds, 2004). The relationship between smoking frequency and discounting does not appear to be mediated by the acute effects of nicotine, since acute nicotine administration to non-smokers has recently been shown to have no effect on intertemporal choice behavior (Kobiella et al., 2018). However the relationship may be related to the level of nicotine dependence (Sweitzer et al., 2008), consistent with discounting being a state-based marker of addiction severity.
Interestingly those who have previously smoked and those who have never smoked do not significantly differ in their discounting behavior for money (Bickel et al., 1999). Furthermore in a prospective study of smoking cessation, participants were separated into a group who received an intervention program directed at reducing smoking and a control group who continued to smoke as usual. The two groups did not differ in their discounting behavior at baseline. Whilst the control group showed no changes in discounting over time, the intervention group (who overall reduced their smoking frequency) showed a significant decrease in discounting for both money and cigarettes after only 5 days into the program (Yi et al., 2008). These results strongly suggest that the state of nicotine addiction acts reversibly to increase discount rates. However there is also evidence to support the idea that discounting operates as an antecedent vulnerability marker, since monetary discount rates in smokers predict rates of smoking adoption (Audrain-McGovern et al., 2009). Monetary discount rates in smokers also predict rates of relapse within smoking cessation programs (Krishnan-Sarin et al., 2007; Yoon et al., 2007; MacKillop and Kahler, 2009; Sheffer et al., 2018; Brown and Adams, 2018), and the ability to abstain from smoking under laboratory conditions (Dallery and Raiff, 2007; Mueller et al., 2009). It seems reasonable to conclude therefore that relationships between discounting and smoking behavior are subject to both state - and trait-based influences (Odum, 2018).
Finally short-term abstinence from cigarettes increases discount rates in addicted smokers (Mitchell, 2004; Field et al., 2006; Yi and Landes, 2018). For example, Field et al. (2006) measured discount rates for hypothetical gains of money or cigarettes in a group of 30 smokers: one randomized group performed the procedures following their usual cigarette consumption, the other following a minimum of 13 h of abstinence from cigarettes. Implied discount rates for both money and cigarettes were significantly higher in the abstinence group.
In summary, monetary discount rates predict many features of smoking behavior: monetary discount rates are higher in current smokers, correlate with smoking frequency and prospectively predict the adoption of smoking and abstinence from smoking. The upward effect of nicotine cravings on discount rates and the decrease in discounting concomitant with reductions in smoking indicate that discounting is influenced by state-based environmental and motivational factors. Taken together these results indicate that relationships between discounting and smoking have both state - and trait-based components; a consideration which most likely also applies to relationships between discounting and other addictive behaviors (de Wit, 2008; Odum, 2018).
Discount rates and alcohol use.
Monetary discount rates exhibit consistent relationships with alcohol use. A recent meta-analysis of studies comparing discount rates in persons meeting clinical criteria of an alcohol dependence syndrome with controls (MacKillop et al., 2018) demonstrated a moderate, highly significant effect (Cohen's d = 0.50, p < 0.0001). Monetary discount rates are higher in currently abstinent alcohol dependent individuals compared with non-dependent controls (Bjork et al., 2004; Mitchell et al., 2005; Boettiger et al., 2007), in early-onset as opposed to late-onset alcohol dependence (Dom et al., 2006), and correlate with the severity of alcohol dependence (Mitchell et al., 2005), as well as symptoms of an alcohol abuse disorder (MacKillop et al., 2018). Monetary discount rates have also been shown to be higher in a group with a previous lifetime diagnosis of an alcohol abuse disorder as compared with those without a lifetime history of alcohol abuse (Bobova et al., 2009).
Several studies have further linked higher discounting with relatively moderate levels of alcohol consumption. Vuchinich and Simpson (1998) demonstrated higher monetary discount rates in “problem drinkers” and also heavy social drinkers, compared with light social drinkers, suggesting a relationship between alcohol intake and discount rates even amongst those designated as social drinkers. Similarly, Field et al. (2007) found that delay discounting for alcohol positively correlated with weekly alcohol consumption (Pearson r = 0.31) amongst adolescents, where those in highest tertile of alcohol use had a mean weekly consumption of 23 units, while those in the lowest tertile had a mean of 3 units. Yankelevitz et al. (2018) found that implied discounting for both money and alcohol was moderately correlated with levels of alcohol use in female students (Pearson r = 0.43 for money and 0.41 for alcohol discounting), though no correlation was found in male students.
Amongst students, monetary discount rates appear related to adverse consequences of alcohol use (Kollins, 2003; Rossow, 2008; Dennhardt and Murphy, 2018). Kollins (2003) observed that monetary discount rates were negatively correlated with age at first using alcohol and showed a strong positive correlation with the number of times that students had “passed out” as a result of alcohol use (Pearson r = 0.73, P < 0.01) and Rossow (2008), studying a sample of 17,413 adolescents in Norway, demonstrated that high monetary discounters became intoxicated more frequently and were more likely to vomit or “pass out” as a result of drinking. Paralleling findings in previous smokers of cigarettes, previously addicted users of alcohol who have achieved long-term abstinence have discount rates intermediate between current users and controls (Petry, 2001). Finally, in an elegant field study, the discount rates of male social-drinkers on their entry to a bar prospectively predicted increases in blood alcohol level on their exit (Moore and Cusens, 2018), such that those that had steeper discounting on entry showed greater increases in alcohol level. Discount rates were not confounded by baseline intoxication, since blood alcohol level at entry to the bar did not significantly predict the baseline discount rates.
In summary, despite a minority of studies reporting negative findings (Kirby and Petry, 2004; MacKillop et al., 2007; Fernie et al., 2018), monetary discount rates (and in some studies discount rates for alcohol) show robust relationships with alcohol intake over a wide range of usage, being higher in currently dependent individuals, where they correlate with the degree of dependence, and predicting use in non-dependent individuals.
Discounting and illicit substance misuse.
Amongst health behaviors, illicit substance misuse exhibits the most consistent relationships with discount rates. In an early study, heroin dependent individuals exhibited monetary discount rates twice those of non-drug-using controls (Kirby et al., 1999). Several other studies have also demonstrated significantly higher monetary discount rates in opioid-dependent individuals compared with controls (Madden et al., 1997; Kirby and Petry, 2004). Monetary discount rates are also higher in users of stimulant drugs such as cocaine and methamphetamine than in non-drug-using controls (Moeller et al., 2002; Coffey et al., 2003; Kirby and Petry, 2004; Heil et al., 2006; Hoffman et al., 2006; Monterosso et al., 2007; Johnson, 2018), with one study finding significantly higher monetary discount rates among individuals primarily using crack cocaine than among those primarily using heroin (Bornovalova et al., 2005). Indeed, MacKillop et al. (2018) estimated a large and highly significant aggregate effect (Cohen's d = 0.87, p < 0.0001) across studies comparing discount rates in dependent users of stimulant drugs versus controls and a moderate highly significant effect across studies comparing discount rates in opiate dependent individuals versus controls (Cohen's d = 0.76, p < 0.0001).
Consistent with nicotine abstinence studies, mild opioid deprivation in opioid dependent individuals increases discounting for money as well as heroin (Giordano et al., 2002), whereas those who have achieved longer term abstinence from heroin have lower discount rates than those currently addicted (Bretteville-Jensen, 1999; Kirby and Petry, 2004). By contrast abstinent formerly dependent cocaine users do not differ in discounting behavior from current users (Kirby and Petry, 2004; Heil et al., 2006) while Hoffman et al. (2006) found no relationship between length of abstinence and monetary discount rates in amphetamine dependent individuals. These findings suggest that either discount rates do not predict abstinence from stimulants, or that addiction to stimulants can have an irreversible effect to increase discount rates. Evidence against the former suggestion is that baseline discounting has been shown to predict the duration of abstinence from cocaine under a contingency management intervention (with low-incentives but not with high-incentives) (Washio et al., 2018).
Monetary discounting has not been consistently associated with concurrent cannabis use. Johnson et al. (2018) found that discount rates for hypothetical money in a group of marijuana dependent individuals did not differ from non-drug using controls, despite their study being adequately powered to detect any such difference (also see Stea et al., 2018). Similarly Heinz et al. (2018) found that monetary discounting did not correlate significantly with frequency of cannabis use over a 90 day period, although higher discounting was associated with younger age at first cannabis use. A recent study has shown that discount rates for hypothetical large monetary amounts ($1000) prospectively predicted abstinence outcomes amongst adolescents undergoing treatment for marijuana dependence (Stanger et al., 2018), more recent studies have (Heinz et al., 2018; Peters et al., 2018) found that discount rates did not predict response to a similar intervention in adults.
Finally studies have demonstrated an additive effect of smoking and alcohol use on discounting (Moallem and Ray, 2018; see also Andrade et al., 2018) but not of smoking and other forms of substance misuse (Businelle et al., 2018), and the combination of gambling problems and substance misuse appears highly predictive of impulsive choice (Petry and Casarella, 1999; Petry, 2001; however see Ledgerwood et al., 2009). In summary, with the exception of cannabis use, monetary discount rates consistently show strong correlations with the use of illicit substances.
Discounting, obesity, and eating behavior.
Researchers have examined relationships between obesity and discounting for both food and money outcomes, citing similarities between eating behavior and addiction. Obese children have been shown to choose immediate over delayed edible rewards more often than normal weight children, though the effects were small and not found for non-food rewards (Johnson et al., 1978; Bonato and Boland, 1983). Notably the ability to delay gratification for food rewards at aged four predicts the likelihood of being overweight at aged 11 (Seeyave et al., 2009). In addition, cross-sectional studies have examined links between measures of monetary discount and Body Mass Index (BMI) in adults, with mixed findings (Epstein et al., 2003; Borghans and Golsteyn, 2006; Nederkoorn et al., 2006; Reimers et al., 2009; Ikeda et al., 2018). In a large sample from the Netherlands financial proxies for the discount rate, for example reported under-saving or excessive expenditure, were significantly correlated with BMI, however measured discount rates themselves were not (Borghans and Golsteyn, 2006). Similarly, Ikeda et al. (2018) found that BMI was positively correlated with a survey measure of procrastination, but showed no correlation with monetary discount rates in a sample of 2987 Japanese adults. In group comparisons, obese women have been shown to exhibit significantly higher discount rates than healthy weight women (Weller et al., 2008), and people who smoke cigarettes who are also obese to exhibit higher rates than non-obese smokers (Fields et al., 2018). Davis et al. (2018) found that obese women with a binge-eating disorder, but not obese women without binge-eating disorder, had significantly higher monetary discount rates than normal weight women. It has been suggested that sensitivity to food rewards interacts with delay discounting, in support of which high discount rates predict palatable food intake amongst normal weight women who find palatable foods highly rewarding (Rollins et al., 2018), an effect which has been replicated in obese and overweight women (Appelhans et al., 2018). More recently Kulendran and colleagues found significantly higher monetary discount rates in obese adolescents compared with normal-weight adolescents (Kulendran et al., 2018a), and demonstrated that monetary discount rates in obese adolescents decreased over the course of a residential obesity intervention (Kulendran et al., 2018b). Taken together these studies suggest an emerging relationship between discounting and weight status, although further work is clearly required to establish whether particular aspects of eating behavior, such as caloric intake, or eating frequency show relationships with discounting.
Discount rates and preventive health behavior.
While some studies find relationships between discounting and preventive health behaviors, the findings are less consistent than for addictive behaviors. As described above, Chapman and Coups (1999) asked whether discount rates for monetary losses, as well as for a flu-like illness, could explain uptake of influenza vaccinations, with the finding that time preferences for money, but not illness, were related to vaccine uptake. In a later study (Chapman et al., 2001) monetary discounting showed an absent or very weak correlation with compliance with anti-hypertensive or cholesterol lowering medication. Similarly, a meta-analysis (Chapman, 2005) of 16 existing studies, including those described above, found no significant correlation between discounting and preventive health behavior (Mean Pearson r = 0.04, 95% CI = −0.01, 0.09). These studies suggest that in the population as whole preventive health behaviors show little or no relationship with discounting. However, two subsequent studies suggest that a subset of the highest discounters diverge from the rest of the population in their patterns of preventive health behavior. Firstly, Axon et al. (2009), studying 422 hypertensive adults, found that those in the highest quintile of monetary discount rates reported that they would be less likely to alter their diet and exercise plans to improve their future health. The highest discounters were not however significantly less likely to check their blood pressure or to follow their doctors' plans, as assessed by self-report. Secondly, Bradford (2018), analyzing discounting in 978 adults, found that for high discounting women the implied probability of attending mammography was reduced by 15.30% over the preceding 2 years and high discounting men had significantly lower rates of prostate examination (probability reduction 8.31%). The influence of discounting on attendance for cervical cancer screening was marginally significant. Across gender, high discounters were significantly less likely to have attended the dentist (probability reduction of 24.8%) or to have had any cholesterol testing (probability reduction 12.38%) or any influenza vaccination (probability reduction 11.05%) over the preceding 2 years. Additionally, high discounters were significantly less likely to be non-smokers or to have undertaken weekly vigorous activity. These studies suggest that monetary discount rates might be a useful tool for identifying groups at risk of failing to engage in preventive health practices.
Discount rates and risky sexual behavior or drug-taking practices.
Convergent evidence associates high monetary discount rates with behaviors that increase the risk of contracting sexually-transmitted or blood-borne viral infections. Individuals infected with hepatitis C exhibit higher rates of discounting than controls (Huckans et al., 2018), although the direction of causality cannot be established from this study. Higher discount rates are associated with needle-sharing amongst heroin users (Odum et al., 2000). Dierst-Daviese t al. (2018) found that a sample of homeless, men who abused substances and had sex with men, had higher discount rates than a control sample of men, deemed to be at lower risk of HIV, who had sex with men however had stable housing and did not abuse substances. Finally, Chesson et al. (2006) found relationships between monetary discounting and a spectrum of sexual behaviors and outcomes in a combined sample of university students and adolescents attending clinics ( N = 1042). For example, adolescents with higher discount rates were more likely to have had sexual intercourse before age 16 years, to have contracted gonorrhea or chlamydia, or to have become pregnant.
Discount rates and multiple health behaviors.
Two studies in our search sample compared discount rates with a broad range of health behavior. Firstly Daugherty and Brase (2018), collected data from 467 undergraduates on an inventory of health behaviors, namely tobacco, alcohol and drug use, number of visits to a doctor or dentist in the past year, exercise frequency, eating breakfast, seat-belt use when in a vehicle, motorbike or bicycle helmet use, and the use of sunscreen. They found that, in a two-step hierarchical regression analysis, a combination of delay discounting for hypothetical money and survey measures of time perspective explained a significant proportion of the overall variance in health behavior over and above the combination of the respondents' gender and their personality type (Costa and McCrae, 1990). At the level of predicting individual behaviors, the improvement in model fit achieved by adding the time preference measures at the second step was small (the largest improvement in R 2 was 0.05) but significant for all the behaviors above except helmet-wearing. Secondly Melanko and Larkin (2018) analyzed data from 72 young adults who performed both a discounting task with real monetary rewards and a hypothetical monetary discounting task as well as completing a Healthy Lifestyle Questionnaire (Corbin et al., 2006), assessing a variety of health behaviors, including smoking, alcohol use, substance misuse, physical activity, nutrition, avoiding destructive habits or practicing safe sex. In a hierarchical multiple regression, implied monetary discount rates for real rewards emerged as a significant predictor of the overall variance in health behavior. With regard to specific health behaviors, discounting for real monetary rewards emerged as a significant predictor of only smoking and nutrition scores. Notably however neither Daugherty and Brase (2018) nor Melanko and Larkin (2018) separated individuals by their level cigarette, alcohol or drug use. As a result the observed relationships between discounting and other behaviors may have been confounded by the effects of these addictive behaviors to increase impulsivity in other domains.
Conclusions: monetary discounting predicts unhealthy behavior.
Taken as a whole, the studies reviewed here support the hypothesis that high discount rates for money, and in specific instances food or drug rewards, are associated with many unhealthy behaviors. Furthermore the effect sizes reported compare favorably to existing social cognitive models of health behavior (Conner and Norman, 2005), establishing high discounting as a reliable correlate of unhealthy choice.
The majority of studies reviewed above are cross-sectional and are therefore indeterminate as to whether high discounting antecedes unhealthy behavior, or vice versa. Two longitudinal studies reviewed here demonstrate that monetary discounting can prospectively predict onset of unhealthy behavior or relapse after health behavior change (Yoon et al., 2007; Audrain-McGovern et al., 2009). However, there is also considerable evidence that discounting is influenced by state-based factors (Koffarnus et al., 2018). In addition, for addictive behaviors, the severity of addiction is positively correlated with discounting (for example Mitchell et al., 2005; Sweitzer et al., 2008; MacKillop et al., 2018), and discount rates have been shown to decrease following behavior change (Landes et al., 2018). These observations combine to suggest that discounting can be viewed as a concurrent marker of the extent of unhealthy behavior, rather than exclusively as an anteceding risk factor. Consistent with either interpretation, a growing number of studies have shown that monetary discounting predicts response to behavior-change interventions (Dallery and Raiff, 2007; Krishnan-Sarin et al., 2007; MacKillop and Kahler, 2009; Mueller et al., 2009; Washio et al., 2018; Sheffer et al., 2018; Stanger et al., 2018; Brown and Adams, 2018). Thus discounting has clear predictive utility and may allow health-behavior change interventions to be tailored to benefit at-risk groups. In addition, interventions may be targeted at modifying the cognitive mechanisms associated with discounting, which are assumed to contribute to unhealthy behavior. For example working memory training has been shown to both reduce discount rates and modify addictive behavior (Bickel et al., 2018).
However, in order for discounting to be upheld as a mechanism for unhealthy choice, the features of unhealthy behavior must also be consistent with the predictions of a particular model of discounting. We test this by examining the proposal that hyperbolic discounting can explain goal-incongruent unhealthy action. We conclude that hyperbolic discounting on its own has explanatory shortcomings, and might be usefully supplemented by a broader conceptual framework.
Does hyperbolic discounting explain goal-incongruent unhealthy actions?
The observation that hyperbolic functions consistently provide excellent fits to choices between delayed rewards has led to the suggestion that hyperbolic discounting may be a guiding computational principle of intertemporal choice (Ainslie, 1975, 2001). In particular, since the curvature of the hyperbolic function predicts myopic preference reversals (Figure ​ (Figure1), 1 ), hyperbolic discounting has been widely proposed as a an explanation for impulsive reward-seeking at the expense of long-term plans (for example Laibson, 1997; Ainslie, 2001; Angeletos et al., 2001; Bickel et al., 2007, 2018). Taken together with the observation that hyperbolic discount rates correlate with many forms of unhealthy behavior (Supplementary Tables 1‘5 indicate which of the above studies measured hyperbolic rates), hyperbolic discounting proffers to explain goal-incongruent unhealthy action. In this section we examine this hypothesis in more detail.
Evidence for and against hyperbolic discounting.
A key prediction of hyperbolic discounting is that, where a smaller-sooner reward is preferred over a larger-later reward, adding sufficient delay to both sooner and later options ought to shift preference toward the larger-later reward (see Figure ​ Figure1). 1). Several studies have demonstrated evidence for this in conventional monetary discounting tasks (for example Green et al., 1994; Kirby and Herrnstein, 1995). Such preference reversals have also been demonstrated in choices with health-relevant outcomes. For example, Read and Van Leeuwen (1998) asked people whether they would prefer to receive in 1 week's time either a healthy snack (such as a piece of fruit) or an unhealthy snack (such as a chocolate bar). The same individuals were followed up and 1 week later they were offered an immediate choice between a healthy and an unhealthy snack. Respondents chose healthy snacks more frequently when the choice was made in advance compared to when the snacks were immediately available. If healthy behavior is considered to carry larger-later health rewards, this result is consistent with the hyperbolic discounting of delayed health.
Despite the above findings, subsequent studies have challenged the hyperbolic model of preference reversal (Read, 2001; Read and Roelofsma, 2003; Sayman and Öncüler, 2009; Kable and Glimcher, 2018; Read et al., 2018). For example Kable and Glimcher (2018) found that discount functions based on a choice set in which all options were delayed by a fixed amount had the same hyperbolic curvature as those based on the same choice set in which the sooner option always occurred immediately: in contrast to conventional hyperbolic discounting, in which all outcomes are evaluated relative to the present time, this “as-soon-as-possible” function does not predict impulsive preference reversals. Additionally, several longitudinal studies have tested the predictions of hyperbolic discounting in real-time using monetary outcomes, with mixed findings (Ainslie and Haendel, 1983; Sayman and Öncüler, 2009; Read et al., 2018). The earliest of these studies found support for the preference reversals predicted by hyperbolic discounting. In this study Ainslie and Haendel (1983) asked participants on a Monday to choose between smaller amount of (hypothetical) money on to be received on Friday and larger amount to be received the following Monday. Participants were offered the choice again on the Friday, this time between receiving the smaller amount (for real) immediately or the delayed amount on the coming Monday. Consistent with hyperbolic discounting, the most common pattern was a preference for the larger-later amount when choices were made in advance, but for the smaller sooner amount when this was immediate. However subsequent studies have not replicated this finding. (Sayman and Öncüler, 2009) found the opposite result using a design similar to that of (Ainslie and Haendel, 1983). Furthermore a recent study performed over several weeks using real monetary rewards showed that preference reversals toward choosing smaller-sooner amounts (that is, in the direction predicted by hyperbolic discounting) were not significantly more common than those in the opposite direction (Read et al., 2018). Importantly this was the case despite the participants displaying hyperbolic discounting in conventional “cross-sectional” choices.
In summary, the preference reversals of the form predicted by conventional hyperbolic discounting have hitherto not been consistently demonstrated with monetary outcomes. This suggests that the preference reversals underlying health-related choices (such as those in Read and Van Leeuwen, 1998) may result from peoples' inability to predict in advance the impact of motivational and environmental states on their future decision-making.
Goal-incongruent actions often result from state changes.
The suggestion that intention-action discrepancies in health choice result exclusively from hyperbolic discounting (Ainslie, 2001) can also be questioned. Everyday experience suggests that people often abandon long-term plans in favor of immediate reward in response to environmental cues or changes in internal motivational state; for example, one might plan to abstain from eating dessert as part of a diet plan, but find it harder to resist when presented with a piece of cake (see for example Allan et al., 2018). Loewenstein (1996) has proposed that motivational drives and cues which elicit them, rather than hyperbolic discounting, are responsible for impulsive preference reversal. This idea is supported by evidence; for example, relapses in drug-taking behavior following abstinence commonly occur after exposure to a previous drug-taking environment (O'Brien et al., 1998). Hyperbolic discounting, to the extent that it applies as a model for preference reversal, may in some instances be sufficient to explain these cue-triggered behaviors, since cues provide information about the timing of outcomes, thereby signaling that reward is at hand. However, hyperbolic discounting does not appear necessary to explain these state-dependent influences. For example, in a study of analgesic preferences for childbirth (Christensen-Szalanski, 1984), women asked roughly 1 month in advance of labor preferred to avoid invasive spinal anesthesia in favor of less invasive but less effective pain relief methods, however during active labor women frequently reversed preference and opted for anesthesia. These findings are easily explained by an increase in the marginal utility for anesthesia during the painful state, which was not accurately predicted in advance, without reference to hyperbolic discounting.
In summary therefore not all forms of health-related preference reversal are consistent with hyperbolic discounting, and many preference reversals occur in response to changes in motivation or environmental states. Taken together with the lack of clear longitudinal evidence for the myopic preference reversals predicted by hyperbolic discounting, this suggests that any generative model of intertemporal health choice ought to be expanded beyond hyperbolic discounting alone to account for the effects of environmental cues. At best hyperbolic discounting alone provides no explicit framework either for incorporating the motivational information provided by environmental cues, or for how this information becomes associated with cues through learning. Existing models for these influences have not aimed to provide a mechanistic level of interpretation (Loewenstein, 1996). The following discussion advances a mechanistic framework based on the principles of reinforcement learning for understanding the effects of environmental cues on intertemporal health choice. Key to this account is the notion that cues previously associated with rewarding actions can trigger goal-incongruent habits, leading to preference reversal even in the absence of hyperbolic discounting. A full exploration of learning is beyond the scope of this review and we therefore restrict ourselves to the effects of cues after learning has taken place.
A reinforcement learning approach to cue-triggered preference reversal.
Reinforcement learning provides an approach to understanding intertemporal choice. Models of reinforcement learning posit that action control proceeds by estimating the expected value of ensuing reward over series of temporally connected future states, encapsulated in a state-action value function (Sutton and Barto, 1998); such models are therefore well placed to incorporate changes in state on choice behavior. Attempting to optimize value in changing environments can be considered a trade-off between flexibility in rapidly incorporating new information and the efficient use of past experience (Daw et al., 2005). This trade-off is embodied by two methods of learning action-value: a rather rigid, but computationally lean method, referred to as model-free, and a flexible, planning method capable of simulating future possible outcomes, often referred to as model-based (Gläscher et al., 2018; Daw et al., 2018; McDannald et al., 2018; Daw, 2018; Wunderlich et al., 2018; Dolan and Dayan, 2018). These systems reflect an established distinction in psychology between deliberative and automatic processes (Evans and Stanovich, 2018), but endow this with a normative and explicitly computational basis (Daw et al., 2005).
A model-based decision-maker is generally assumed to search through the possible future states consequent on each action. Model-based decision-making corresponds to the definition of “goal-directed” behavior in animal learning experiments as rapidly sensitive to changes in outcome value or the contingency between response and outcome (Colwill and Rescorla, 1986; Dickinson and Balleine, 1994; Balleine and Dickinson, 1998; Domjan, 2003). A model-free decision-maker, by contrast, through a gradual integration of outcome values encountered through experience, assigns a scalar estimate of long-run future value to taking an action in a particular state, without explicitly representing the corresponding future state of the world. The resulting cached action values are relatively insensitive to immediate changes in the outcomes. Instrumental behavior is initially goal-directed (model-based), but becomes increasingly model-free with learning, such that actions eventually become insensitive to changes in the value of the outcome, acquiring the characteristic of habits (Dickinson et al., 1995; Ouellette, 1998; Neal, 2006). This bears direct analogy to economic models of habit formation, which modify the instantaneous utility function to depend on past consumption (Becker and Murphy, 1988).
The differential engagement of these two systems has the potential to explain the environmental dependence of the preference reversals which underlie many forms of unhealthy behavior. While steep temporal discounting over the model-based valuation of future health would be expected to encourage the initiation of unhealthy behavior, with repetition, unhealthy behavior is likely to become increasingly model-free, or habitual. At this stage, even if the decision-maker re-evaluates their goals in favor of making healthy choices, cached action values will continue to encourage unhealthy choice in response to relevant environmental cues, leading to apparently impulsive preference reversals (intention-action discrepancies).
Environmental cues can trigger goal-incongruent habits.
Once a person has initiated an unhealthy behavior, they may later change their goals and form the intention to abstain from that behavior. For example, a person who smokes might decide to quit after being diagnosed with lung disease. However, if sufficient learning has taken place, the behavior might nevertheless be maintained as a stimulus-response habit under the dominant influence of cached action values. Thus, the smoker might find it particularly hard to resist when he or she spies the cigarette packet. As we outline below, the goal-incongruent influence of cached (habitual) action values can produce preference reversal, without invoking hyperbolic discounting. Furthermore, preference reversal can result even if each system in isolation exhibits exponential discounting and discounts the future at the same rate, a crucial distinction from dual-process models of quasi-hyperbolic discounting (Laibson, 1997; Angeletos et al., 2001; McClure et al., 2004; Bickel et al., 2018; Koffarnus et al., 2018). To demonstrate this formally, consider a decision-making agent for whom overall action value is a weighted average of the value from each controller, where both systems discount the future exponentially with a per period rate, γ (γ is the conventional symbol for the discount rate in reinforcement learning approaches; its meaning is equivalent to that of δ in Equation 1). Say, for example, the agent is a person following a diet plan who is choosing whether or not to consume a calorific biscuit when faced with a cue, the biscuit tin. A simplified (semi-Markov) state space for this decision is depicted in Figure ​ Figure2A. 2A . State B represents the presence of the biscuit tin. Consuming biscuits leads after a short delay, d c , to state C, which carries reward, R c , and after a longer delay, d h , to maintaining current weight, for simplicity here assigned a reward value of zero. Abstaining from biscuits leads, via the unrewarded state, A , to a health benefit in the form of weight loss, R h , after delay d h . Notably this is a radical simplification of reality. It is assumed that, while the model-based system is capable of making such simplifications based on declarative knowledge, the model-free system cannot, and has never experienced the health consequences. As a result, the model-free system has learned the values of each action in state B (termed “ Q - values”) based solely on the reward previously provided from consumption (Figure ​ (Figure2B). 2B ). It is assumed that the model-based system is initially naive to these cached values. Consider that the agent, after learning, is asked to make their decision when situated in state P, at some time delay, p in advance of state B, where cached values have no influence, and that here they are indifferent between indulging and abstaining, that is to say that the model-based value, Q MB , of consumption is equal to that of abstention:
Which simplifies to:
On reaching B, the presentation of the biscuit tin, cached action values are also “brought online,” incrementing the benefit of indulging, such that:
And therefore, by (3c):
Predicting a preference for indulging for ω < 1 (Figure ​ (Figure2C). 2C ). Therefore the presentation of the biscuit tin brings about a preference for sooner consumption. In economic terms, environmental cues such as the biscuit tin might be viewed as updating the utility of the immediately available option, by providing (previously inaccessible) information from prior experience.
The interplay between model-based and model-free systems in the account above bears some similarity to existing dual-systems models of intertemporal choice, which posit a deliberative planning system in opposition with an impulsive system. However, while the former are often mapped onto quasi-hyperbolic models of discounting (McClure et al., 2007, 2004), which combine two exponential discount functions with differing rates, here the two systems may share the same discount rate. Dynamic inconsistency can then result from the different sources of information available to either controller (also see Dayan et al., 2006). In particular, the state-dependent valuations of the cached system can explain why real-world preference reversals occur in response to learned cues and, unlike existing quasi-hyperbolic accounts, why these preference reversals become more prominent with the formation of habits. In addition, unlike existing dual-process accounts, reinforcement learning models can explicitly model the learning process generating incongruent preferences. (A direct treatment of learning is beyond the scope of this review).
Exponential discounting is used here to illustrate that hyperbolic discounting is not necessary to predict preference reversals, although clearly hyperbolic discounting is more consistent with cross-sectional intertemporal choice data than exponential discounting. The framework above could readily incorporate hyperbolic discounting, and several authors have demonstrated reinforcement learning models which produce hyperbolic discounting (Daw and Touretzky, 2000; Tsitsiklis and Van Roy, 2002; Kalenscher and Pennartz, 2008; Kurth-Nelson and Redish, 2009; Alexander and Brown, 2018).
Model-based and model-free interactions in addiction.
The relative contributions of model-based and model-free strategies might in part explain why discount rates correlate particularly strongly with addictive behaviors (Keramati et al., 2018; Lucantonio et al., 2018). Steep discount rates would putatively favor initial goal-directed drug-seeking behavior (as with other forms of unhealthy behavior). The high rewards provided by substances of abuse might then lead to rapid habitization of drug taking behavior by comparison with other repeated behaviors (Everitt and Robbins, 2005; Everitt et al., 2008; Lucantonio et al., 2018), effectively binding impulsive individuals to their initial choices. In addition, repeated choice of immediately available rewards by individuals with high discount rates would be expected to lead to these individuals acquiring habits more rapidly (by more reinforced choices). In support of this, animal studies of addiction demonstrate that rats bred to exhibit steeper delay discounting more rapidly acquire compulsive self-administration of cocaine than their low discounting counterparts (Belin et al., 2008). Finally, chronic addiction may further shift responding toward model-free control (Keramati et al., 2018), in part by damaging frontal cortical areas on which model-based valuations are thought to depend (Rogers and Robbins, 2001; Gläscher et al., 2018; Camchong et al., 2018; Smittenaar et al., 2018), further decreasing the capacity to exert model-based control over goal-incongruent habits. Although these mechanisms most likely play in a role in addiction, there is an ongoing debate as to their precise contribution, and in particular the interplay between habitual mechanisms and classical (Pavlovian) conditioning in addictive disorders (Everitt and Robbins, 2005).
Future directions.
The studies reviewed here indicate that discounting is a promising predictor of health behavior, however hyperbolic discounting is challenged as an explanation for the discrepancy between intentions and actions in health choice, and a framework based on the trade-off between model-free and model-based action control appears better placed to incorporate the influences of environment and learning. Nevertheless the study of myopic health-related decision-making remains nascent. Further work is required firmly establish discounting as a predictive tool, to extend the measurement framework and to develop novel interventions capable of reducing goal-incongruent health choice.
Discounting as an individual difference measure.
The endeavor to predict and understand health behavior through comparison with discounting measures forms part of a wider paradigm to characterize individual differences in field behavior using decision-making tasks (Montague et al., 2018). A question relevant to this endeavor is the extent to which discounting can be considered as a either a personality trait or a state variable (de Wit, 2008; Odum, 2018; Bickel et al., 2018). Personality traits are defined as stable and enduring characteristics, reflecting a general tendency to respond in a given manner under given circumstances and can be seen to represent persistent patterns of internal states (see Costa and McCrae, 1990). State variables by contrast vary over a shorter time scale, such that their rank ordering between individuals may be altered with changes in the motivational state of the respondents and/or the elicitation conditions (Kraemer et al., 1993).
Several pieces of evidence reviewed here demonstrate that discounting has a state-based component. Firstly studies comparing the discounting of hypothetical health with that of money find that discounting varies with the domain and valence of the outcome, in a manner that changes the rank ordering of discount rates between individuals. Secondly discount rates amongst substance misusers are greater in a state of drug-craving than in a drug-sated state. Thirdly, even under conditions of drug-satiety, addiction appears to be associated with a reversible increase in discounting. These findings are supported by a wealth of additional evidence showing that discounting can be manipulated through contextual framing (see Koffarnus et al., 2018 for a review). Indeed from a normative perspective it is sensible for agents to adjust their tolerance of delay to match environmental conditions; for example steep discounting is adaptive in an environment where delayed rewards are highly uncertain to be received (see Lahav et al., 2018).
However there is also evidence that discounting has attributes of trait variable. The test-retest reliability of monetary discounting is substantial at intervals of up to 1 year (Pearson r = 0.71; Kirby, 2009) and across different elicitation methods (Odum, 2018). Furthermore, whilst monetary discounting is poorly correlated with hypothetical health discounting across individuals monetary discount rates are strongly and significantly correlated with other forms of appetitive outcome, such as the discounting of cigarettes for cigarette smokers, the discounting of heroin for opioid-dependent outpatients and the discounting of food amongst college students (Odum, 2018; Pearson r = 0.93; p = 0.0007 for money versus the mean of all other outcomes). There is also evidence that discounting is heritable (see MacKillop, 2018 for a review). A recent longitudinal twin study estimated the heritability of delay discounting in adolescence at up to 50% (Anokhin et al., 2018), rats and mice can be bred to exhibit greater degrees of delay discounting (e. g., Anderson and Woolverton, 2005; Belin et al., 2008), and steeper discounting in humans is associated with specific polymorphisms related to dopamine signaling (Eisenberg et al., 2007). Also commensurate with discounting as an enduring trait, steeper discounting is associated with lower socio-economic status (e. g., Bradford, 2018; Anokhin et al., 2018). In summary, discounting for appetitive outcomes has features of a trait marker.
Trait-level differences in discounting can be viewed as long-term adaptations to prevailing environmental conditions, shaped either through learning or inheritance (this notion is consistent with a branch of evolutionary theory termed Life History Theory; Del Giudice et al., 2018; Del Giudice and Ellis, 2018). An important direction for future research will be to examine the relative contributions of genes and childrearing environment to discounting. The study of self-regulation in developmental psychology has adopted this approach; for example children who experience emotionally close, sensitive, and responsive caregiving have been found to exhibit higher levels of self-regulation (Belsky et al., 2007), and Berry et al. (2018) find evidence that self-regulation ability appears to be more sensitive to early childcare experiences in a group with a particular dopamine receptor polymorphism. Furthermore low childhood self-regulation has been prospectively related to poorer health outcomes later in life (for example Francis and Susman, 2009; Seeyave et al., 2009; Moffitt et al., 2018). Future research into delay discounting would benefit from a similar developmental perspective to better understand the origins of trait-level individual differences.
In conclusion discount rates are far from immutable, and are sensitive to environmental and motivational conditions. However, discounting for appetitive outcomes is stable across individuals when measured under similar conditions, is partly heritable and is associated with a range of similar constructs, and as a result has the potential to provide an endophenotype which mediates between genetic influences, more fundamental neuro-computational processes and maladaptive patterns of impulsive behavior in the real-world (MacKillop, 2018). Further work is required to more completely characterize the relationships between these levels of analysis. We have proposed that discounting is best considered within a broader framework for understanding choice between temporally extended outcomes, based on the theory of reinforcement learning. We have shown how the interaction between model-based and model-free value estimates may contribute to real-world instances of goal-incongruent unhealthy choice. However several important questions remain largely unanswered. For example, can the balance of model-based versus model-free control be measured, and can such measures be used to predict health-related behavior? Is there a trait component to this balance? What is the relationship between model-based control and measured discount rates, or related metrics of self-regulation? We briefly address these questions in turn below.
Measuring model-based and model-free interactions.
One approach to measuring the interaction between model-based and model-free decision-making is to directly observe the acquisition of habitual behavior through repeated training on a given laboratory task. Here, the rate of acquisition of habitual responding may offer a novel measure for predicting field behavior. Using this approach, outcome-insensitive habits have been demonstrated in humans (Tricomi et al., 2009), providing a behavioral counterpart to studies of habit formation which measure subjective automaticity (Lally et al., 2018). An important aim for future studies will be to examine habitual or cue-triggered preference reversals in real-time. Along these lines, subjective measures of habitual automaticity in relation to smoking behavior have been shown to predict goal-incongruent smoking-related responses (Orbell and Verplanken, 2018). However, observing habit learning directly is time-consuming. Recent human studies (Gläscher et al., 2018; Daw et al., 2018; Eppinger et al., 2018; Smittenaar et al., 2018) have used a paradigm with a probabilistic tree structure which separates model-free and model-based control, before habitization has taken place, depending on whether respondents incorporate the transition structure of the task into their learning (model-based) or learn solely based on the reinforcement obtained in each discrete state (model-free). Humans performing this task generally exhibit some combination of the two modes of control, and the relative contribution of the two strategies may provide a novel behavioral marker. Further studies are required to establish the longitudinal stability of these measures, and whether they have a trait component, as well as to examine their relationship with habitual behavior in the field.
Relationships between model-based control, discounting and self-regulation.
Responding on discounting paradigms cannot easily be considered habitual, and most likely requires model-based processes. Nevertheless, we propose that directly representing outcomes during choices on discounting paradigms, rather than relying on a low-level tradeoff between amount and delay, is likely to be associated with more future-oriented responses. In line with this suggestion, mentally simulating future outcomes decreases measured discount rates (Peters and Büchel, 2018) and lesioning neural structures on which this simulation process depends, such as the hippocampus (Hassabis et al., 2007; Johnson et al., 2007a; Schacter et al., 2008) increases discounting (Mariano et al., 2009). Furthermore, existing studies suggest that the choice of delayed rewards, model-based control and working memory engage overlapping neural substrates: neuroimaging studies have found that the dorsolateral prefrontal cortex (dlPFC) is activated in both model-based learning (Gläscher et al., 2018), and in choosing delayed rewards on intertemporal choice paradigms (McClure et al., 2004, 2007), while disrupting this area (using either transcranial magnetic stimulation or transcranial direct current stimulation) both decreases model-based behavior (Smittenaar et al., 2018) and increases temporal discounting (Hecht et al., 2018). A recent study has also demonstrated that in younger adults, but not in older adults, a greater degree of model-based behavior is associated with higher working memory capacity (Eppinger et al., 2018). The finding that working memory training decreases discounting among substance misusers (Bickel et al., 2018) is especially interesting in this regard. A possible unifying interpretation is that explicitly representing the future consequences of action, a process associated with model-based decision-making, produces more future-oriented choice and hence lowers discount rates (see also Peters and Büchel, 2018) and that this process is also limited by working memory capacity. Notably the exercise of model-based control is similar to existing definitions of self-regulation, as “the largely (but not exclusively) volitional act of managing attention and arousal in a manner that facilitates goal-directed behavior” (Berry et al., 2018, p. 2). An advantage of the reinforcement learning approach is its ability to formalize such behavior within a normative computational framework. It is important to reiterate here that, whilst we view model-based valuations as supporting future-oriented choice, we do not identify the model-free controller with an “impulsive system.” In our view both controllers share the same fundamental goals and it is the relative inflexibility of model-free decision-making which gives its responses their short-sighted character (Dayan et al., 2006).
Novel behavioral predictors and interventions.
Several additional approaches may yield novel behavioral markers of unhealthy choice. Unlike the naïve decision maker described above, people often demonstrate that they can predict their future tendencies, termed sophistication , for instance by choosing paths that remove their opportunity to make myopic choices, an activity referred to as pre-commitment (Ainslie, 2001; Ariely and Wertenbroch, 2002; Prelec and Bodner, 2003). For instance, a person attempting to abstain from smoking might throw away their cigarette packets. Pre-commitment would be expected to obscure real-world relationships between discount rates and myopic behavior, since at least a subset of sophisticated steep discounters would exhibit far‘sighted real‘world choices. Furthermore, within the model-based versus model-free framework above, we propose that the ability to predict and therefore pre-empt the influence of state changes on one's behavior is a key substrate of self-control. Economic theories of pre-commitment, often based on quasi-hyperbolic discounting, provide a useful conceptual framework for predicting the effects of varying degrees of sophistication on behavior (O'Donoghue and Rabin, 2003), and computational models of these processes (Kurth-Nelson and Redish, 2018) offer the potential to enrich predictions of health behavior.
An additional important influence not considered above is the effect of internal motivational states. The effects of motivational state on habitual responding are complex, having immediate effects on the vigor of responding, while having effects on choice by altering the utility of outcomes (see Niv et al., 2007). Models which formalize these effects remain the subject of ongoing theoretical work, though may eventually provide a valuable substrate for applied health behavior research.
Although not discussed in detail here, since many health-promoting behaviors are to a degree aversive, measures of dread (Berns et al., 2006; Story et al., 2018) might form a predictor of engagement in such behaviors. A complexity tending to preclude clear a priori predictions in this area is that, if dreading aversive health-promoting behaviors were to promote their avoidance (e. g., Kleinknecht, 1978), then dreading illness would be expected to have the opposite effect, promoting engagement in such behaviors. Perhaps reflecting these competing influences, Chapman and Coups (1999) report that rates of vaccination uptake were not significantly higher in individuals with negative time preference for illness, as compared to individuals with positive time preference for illness.
Finally, the account above focuses on instrumental learning. However, there is also evidence that animals use state-state, as well as state-action-state, associations to guide action. This third mode of learning, embodied by classical conditioning, is referred to as Pavlovian learning (Domjan, 2003). Based on state-state predictions, the Pavlovian controller initiates stereotyped actions directed toward obtaining predicted rewards. Crucially, unlike instrumental control, Pavlovian actions are initiated regardless of whether or not they lead to reward (Williams and Williams, 1969). The precise contributions of instrumental and Pavlovian effects to real-world choices are difficult to distinguish. Nevertheless the mechanism of choice inconsistency proposed above for the case of model-based and model-free interactions would remain largely equivalent for the case of interactions between model-based and Pavlovian decision-making (Dayan et al., 2006). An advantage of the reinforcement learning approach is its ability to generate simulations of these interactions over the course of learning and such models may yield parameters capable of explaining further variance in health behavior.
Novel interventions might be directed at specific constructs within the above framework, and indeed several existing health behavior interventions can be viewed in this manner. For example strategies aimed at making healthy choices habitual are already known to be effective (Lally et al., 2007). There is an urgent requirement for novel interventions capable of reducing goal-incongruent unhealthy choice, since the increasing burden of disease attributable to unhealthy behavior is placing unsustainable demands on existing healthcare systems (Smith et al., 2018). We propose that the identification of health decision-making phenotypes will play an important role in evaluating and optimizing the necessary interventions.
Author contributions.
All authors (Giles W. Story, Ivo Vlaev, Ben Seymour, Raymond J. Dolan, and Ara Darzi) contributed to the conception of the work. GS performed systematic review and drafted the work. All authors (Giles W. Story, Ivo Vlaev, Ben Seymour, Raymond J. Dolan, and Ara Darzi) were involved in revising the work critically for important intellectual content. All authors (Giles W. Story, Ivo Vlaev, Ben Seymour, Raymond J. Dolan, and Ara Darzi) gave final approval to the version of the manuscript being published and agree to be accountable for all aspects of the work.
Conflict of interest statement.
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Agradecimentos.
We would like to thank Zeb Kurth-Nelson and Peter Smittenaar for their helpful comments. This work is supported by a Wellcome Trust Senior Investigator Award to Raymond J. Dolan (098362/Z/12/Z). The Wellcome Trust Centre for Neuroimaging is supported by core funding from the Wellcome Trust 091593/Z/10/Z.

No comments:

Post a Comment